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Abstract. We show that the Maxwell-Klein-Gordon equations in three di- 
mensions are globally well-posed in in the Coulomb gauge for all s > 
V3/2 « 0.866. This extends previous work of Klainerman-Machedon |24| on 
finite energy data s > 1, and Eardley-Moncrief 1 1 1 J for still smoother data. We 
use the method of almost conservation laws, sometimes called the "I-method" , 
to construct an almost conserved quantity based on the Hamiltonian, but at 
the regularity of rather than H%. One then uses Strichartz, null form, 
and commutator estimates to control the development of this quantity. The 
main technical difficulty (compared with other applications of the method of 
almost conservation laws) is at low frequencies, because of the poor control 
on the L% norm. In an appendix, we demonstrate the equations' relative lack 
of smoothing - a property that presents serious difficulties for studying rough 
solutions using other known methods. 
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1. Introduction 



1.1. The (MKG-CG) system. Let R 1+3 be Minkowski space endowed with the 
usual metric -q :— diag(— 1, 1, 1, 1). Let <j) '■ R 1+3 — > C be a complex-valued field, 
and let A a : R 1+3 — > R be a real one-form. Here and in what follows, we use Greek 
indices to denote the indices of Minkowski space, and Roman indices to denote 
spatial indices (e.g. a, ft, 7 € {0, 1, 2, 3}; k e {1, 2, 3}), raised and lowered in the 
usual manner. One can then think of A as a £7(1) connection, and can define the 
covariant derivatives D a by 

(1) D a <j) := (d a + iA a )<f). 

We can define the curvature F a p of the connection A as the real anti-symmetric 
tensor 

(2) F afs := -[D a ,D ] = d a A p - d p A a . 



The (massless) Maxwell-Klein- Gordon equations for a complex field <fi and a one- 
form A a are given by 

(MKG) ' V ; 

which are the Euler-Lagrange equations for the Lagrangian 

)-D a cpD^4>+- A F a pF a P dxdt. 

RxR 3 Z 4 

Throughout this paper we follow the convention that repeated indices are summed 
over their range. (For example, here F a p :— Y%=q d^F a p for each a.) We split A 
into the temporal component Aq and the spatial component A := (A\,A2, A3). We 
similarly split the covariant spacetime gradient D a into the covariant time derivative 
Dq = dt + iAo and the covariant spatial gradient D_ := (Di,D 2 , D 3 ) — V x + iA. 
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The Maxwell-Klein- Gordon system of equations has the gauge invariance 

4> i-> e lx (f>; A a h- > A a d aX 

for any (smooth) potential function \ '■ R 1+3 — > R- From this and some elementary 
Hodge theory, one can place this system of equations in the Coulomb gauge V x • 
A = 0. In this gauge the Maxwell-Klein-Gordon equations become the following 
overdetermined elliptic- hyperbolic system of equations (see |24) ) : 



(3) AA = -Im(0A^) 

(4) d t V x Ao = -(1 - P)Im(0V^) + (1 - P)(A|0| 2 ) 

(5) UA = -PIm(0V^) + PUI0I 2 ) 

(6) U<j) = -2i{VA) ■ V x( j) + 2iA d t 4> + i{d t A a )<j) + \A\ 2 (j) - A%^> 

(7) V, • A = 0; 

here □ is the standard d'Lambertian 

□ := d a d a = -d 2 t + A 

and P := A~ 1 d*d is the spatial Leray projection onto divergence-free vector fields 

PA := A- x Vx x (V, x A); (1 - P)A = A^V^V, ■ A). 

Observe that P can be written as a polynomial combination of Riesz transforms 
li := IV.I" 1 ^, 



Rj ■= IVJ" 1 ^-, where IV, I := 



We refer to the system ©-0 collectively as (MKG-CG). We shall write $ := 
(Ao,A, (j>) to denote the entire collection of fields in (MKG-CG), and use $ := (A, 4>) 
to isolate the "hyperbolic" or "dynamic" component of these fields. (From ((3]) we 
see that Aq obeys an elliptic equation rather than a hyperbolic one.) 

We can study the Cauchy problem for (MKG-CG) by specifying the initial datcQ 
$[0]. Although we specify initial data for Aq, it is essentially redundant (assuming 
some mild decay conditions on Aq at infinity) since by ©, ©, the data $[0] 
must obey the compatibility conditions 

dWA(0) = divd t A(0) = 

(8) AA o (0) = -Im (V(O)A^(O)) 

d t V x A (0) = -(1 - P)Im (#0)5#j)) • 

In Section [3] we show how these conditions allow A to be reconstructed from $ . 

Remark 1.2. If we ignore the "elliptic part" of the equations, then heuristically the 
system (MKG-CG) takes the schematic forrr0 

(9) = 0($V M $) + 0($$$) 

although this caricature does not capture the full structure of the equation. Indeed, 
in [Ell it was observed that the interplay between and the bilinear terms in |S| , 
© allow us to write the most important components of the quadratic portion 



^Here and in the sequel we use <f>[t] as short-hand for (<f>{t), <j>t(t)). 

2 The 0() notation is made precise in the second paragraph of Section \E\ below. For now, the 
notation can be taken to mean "terms that look like" . 
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0( < E > Vt,x < i ) ) of the nonlinearity in terms of the null forms Qjk((f,tp) := djipdkip — 
dkfdjtp. We shall return to this point in Section [6] 

Remark 1.3. The system (MKG-CG) is invariant under the scaling 

(10) $(t,x)* 

which suggests that the natural scale-invariant space for the initial data $[0] is 

#1/2 x #-1/2 

For anjQ s > 1/2, define the norm [i/ s ] on initial dat£@ by 

(11) ll*[0]||[H-] := l|V M $(0)|| Krl + 111(0)11^. 

where H!j, is the usual inhomogeneous Sobolev norm := | (V) s uj|L2 , not to be 

confused with the homogeneous Sobolev norm ||w||/j s = || |Va;| s w||i2 . Here and in 

the sequel we use (ir) as short-hand for (1 + l^l 2 ) 1 ' 2 . We refer to [H 1 ] in particular 
as the energy class. 

Remark 1.4. The energy class [H 1 ] is almost the H\ x L? x norm of $[0], a difference 
being that we do not place A§ in L 2 . Indeed, even if $ is smooth and compactly 
supported, we see from ([3]) and the fundamental solution of the Laplacian that Aq 
might only decay as fast as 0(1/ \x\) at infinity, which is not in L\. Thus we see that 
the non-local nature of the Coulomb gauge causes some difficultie^ with the low 
frequency component of Aq. Although these difficulties will cause much technical 
inconvenience, they are not the main enemy in the low regularity theory, and we 
recommend that the reader ignore all mention of low frequency issues at a first 
reading. In particular, the reader should initially ignore the technical distinctions 
between the inhomogeneous Sobolev norm H x and the homogeneous counterpart 

1 .5. Prior results. The following local and global well-posedness results are known. 
Regarding global solutions, if the initial data $[0] is smooth and obeys the compat- 
ibility conditions ©, then there is a unique smooth global solution of (MKG-CG) 
with initial data $[0]; see [TT]. Furthermore, one has global well-posedness in the 
energy class and above: 

Theorem 1.6. [24] Let s > 1, and $[0] € [H s ] obey the compatibility conditions 
([8]). Then there exists a global solution 4> to (MKG-CG) with initial data $[0]. 
Furthermore, for each time t the solution map $[0] <-> *f>[t] is a continuous map 
from [H s ] to [H s ], and we have the bound 

(12) \mn\[H>] <c( S ,h$[o]|| [h1 )<t) c ( s > 

^We remark that the condition s > 1/2, besides being natural from scaling considerations, is 
also important for making sense of the non-linearity 0( ( J > Vt l2 ,"I > ) and the compatibility conditions 
J5J, since when s < 1/2 one cannot make sense of a product of an function and an 
function, even in the sense of distributions. We do not consider here the delicate issue of what 
happens at the critical regularity s = 1/2. 

As is usual for wave equations, regularity in time and regularity in space are essentially 
equivalent, so we always expect dt& to have one lower degree of spatial regularity than <E>. 

^On the other hand, we do not have these issues with the time derivative dtAo. Indeed, if 
J?(0) S H\ then from J4j and some Sobolev embedding we see that \7 x dtAo S L 6 / 5 , and hence 
that d t A £ L\. 
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for all T > 0, where C(s, \\$>[0]\\[h s ]) an d C(s) are positive quantities depending 
only on s, \\^[0]\\[h s ] an d on s respectively. 

One of the key ingredients in obtaining global well-posedness (as opposed to just 
local well-posedness) is the well-known fact that the flow (MKG-CG) preserves the 
Hamiltonian 

H[*M] := j\\V«Mt) - 9tA(t)\ 2 + hv x x Mt)\ 2 
(13) J \ 

+ -\ Do m 2 + ^\Rm\ 2 dx. 

This Hamiltonian is clearly non-negative. In the Coulomb gauge V ' x ■ A = the 
Hamiltonian turns out to be roughly equivalent to II^MIIj^ij; see Section|4j 

Theorem 11.61 includes also local well-posedness in [H s ] in the range s > 1. This 
condition for local well-posedness has been lowered to s > 3/4 by Cuccagna [TDJ 
(see also Theorem 15 . II below) . and down to the near-optimal value of s > 1/2 in [3D] 
(see also a similar result for a model problem in [25] . and [37], [53] for analogous 
results in higher dimensions). Our results here shall rely primarily on the local 
theory in [10] and not on the more sophisticated techniques in [30] . 

1.7. Main result. The purpose of this paper is to consider the corresponding 
question of global well-posedness below the energy class. Our main result is 

Theorem 1.8. Theorem \1.6\ also holds in the range 1 > s > V3/2. 

Remark 1.9. This result was announced previously by the first and third authors 
in the smaller range 1 > s > 7/8. Prior to the finalization of this paper, we had 
announced this result for the improved range s > 5/6, but some of the estimates 
used in that argument turned out to be incorrect. 

These results can be compared with the theory for the nonlinear wave equation 
(NLW) B(p = \cp\ 2 ip 

(compare with (J5J). This equation has the same scaling (ITU1) as (MKG-CG) but 
is simpler due to the lack of derivatives in the nonlinearity. For this equation one 
has local well-posedness all the way down to the critical regularity s > 1/2 (though 
at s — 1/2 the time of existence depends on the profile of the data, and not just 
its norm), and global well-posedness for small iJ 1 / 2 data, see e.g. [2H] HH EE] • For 
large data global well-posedness is known for s > j| [21] , extending previous work 
that had gotten to s > | [17], [13], [2]. Since the local well-posedness theory for 
(MKG-CG) has been improved in [3D] to nearly match that for (NLW), one might 
therefore hope to improve the results here to, say s > 3/4, although this is by no 
means automatic and we will not do so here given that the argument is already 
quite lengthy. 

Our proof proceeds by the method of almost conservation laws, sometimes called 
the "I-method" , introduced in [16] and in the earliest versions of the present work 
(see e.g. [3] for use of the method in a more straightforward context than the 
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present one). The basic idea is to introduce a special smoothing operator I = In 
of order 1 — s depending on a large parameter N, and then consider the quantity 
H[7$[t]], which turns out to be finite (but large) for $[t] € [H s ]. When s = 1 
then I is the identity and H [/$[£]] is exactly conserved. When s < 1 we do not 
have exact conservation, but we will be able to show (using a modified local well- 
posedness theory) that H[/$[i]] is "almost conserved" in that its derivative is very 
small (indeed, it will be bounded by a negative power of N). This will allow us 
to control the solution for long times (a positive power of N). Letting N go to 
infinity we obtain the result. Unfortunately the operator / maps to with 
a large operator norm (like 0(N 1 ~ S )), and when s is large this loss of N 1 ~ s can 
overwhelm the almost-conservation of H [/$[£]], which is why we have the rather 
artificial restriction s > y/3/2. Subsequent refinements of the "I-method" in [5]- 
[9] suggest that this restriction can be lowered by adding additional "damping 
correction" terms to H[/$[t]] to reduce the size of the derivative, but we will not 
pursue these matters here. Certainly we do not expect V3/2 to be the sharp 
threshold of global well-posedness. (For instance, many, though unfortunately not 
all, of the components of the argument are also valid in the regime s > 5/6, and 
some parts are even valid in the range s > 3/4.) 

These results are similar to those of the earlier work of BourgainpQ and later authors 
in obtaining global well-posedness for nonlinear wave, Schrodinger and KdV-type 
equations below the energy norm. However the methods are slightly different; 
instead of using a smoothing operator /, the method of Bourgain relies on truncating 
the solution u at frequency N into a low frequency component and a high frequency 
component, and controlling the evolution of the two components separately (except 
for some periodic adjustments at regular intervals). This approach gives much 
better control on the solution (for instance, the method shows the high frequencies 
behave almost like the linear flow), but requires "extra smoothing estimates" on the 
nonlinear component of the solution, in particular placing that component in the 
energy class even when the solution is in a rougher Sobolev space. For the equation 
(MKG-CG) these extra smoothing estimates are not available for the worst term in 
the nonlinearity, namely P(A) • V x cj), mainly because of the derivative V^; indeed 
in the appendix we will give an argument that shows that this extra smoothing 
fails for [H s ] solutions for any s < 1. Fortunately, the Fmethod can circumvent 
this problem by using commutator estimates as a substitute for extra smoothing 
estimates. See [38j §3.9] for some further discussion. 



1.10. Organisation of the paper. After setting some general notation in Sec- 
tion [2] we describe some useful elliptic estimates for Aq in Section [3j we are able to 
investigate the Hamiltonian in Section HJ and show that this Hamiltonian largely 
controls the [H 1 ] norm of $. This allows us to begin the proof of Theorem 11.81 
in Section [SJ where we reduce matters to showing a standard local well-posedness 
result (Theorem 15.11) as well as an almost conservation law (Proposition I5.4[) for 
the modified Hamiltonian H [!<&]. To achieve these tasks, we write the (MKG-CG) 
equation in Section |6] into a more schematic form which will be more convenient to 
manipulate. After recalling some # s ' b theory in Sections [Ml we are then quickly 
able to establish the local well-posedness result in Section [9] To prove the almost 
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conservation law, we will then need a modified local well-posedness result (Propo- 
sition 110. ip , established in Section [TO] We then differentiate the Hamiltonian in 
Section 111! leading to a number of commutator terms we need to control. The 
terms arising from cubic nonlinearities are relatively easy and are dealt with in 
Section 1121 The terms arising from bilinear nonlinearities are rather complicated 
and we shall deal with them en masse using some specialized notation in Section 
I13[ which allows us to deal with the non-null form bilinear terms in Section [14] and 
the null form terms in Section [151 

Finally, in the appendix we demonstrate why the system (MKG-CG) does not have 
the smoothing effect necessary for Bourgain's Fourier truncation method 1 to be 
applicable. 
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2. Notation 

We fix an exponent s, which will usually be in the range V3/2 < s < 1. We use 
C to denote various large constants depending on s and on some other quantities 
which we will indicate in the sequel. We use A < B (or A = 0(B)) to denote the 
estimate A < CB and A <C B to denote the estimate A < C~ 1 B. We use a+ and 
a— to denote expressions of the form a + e and a — e, where < e = e(s) <C 1 
denotes a small number; the implicit constants C referred to above are allowed to 
depend on e. Note that a may be negative; thus for instance — h+ = — (h— ) is a 
number slightly larger than — |. 

Given any Banach space X and any injective linear operator T : X — > Y, let TX 
denote the Banach space {Tu : u £ X} with norm 

\\u\\tx ■= \\T~ x u\\x- 

If X is a Banach space, we shall use B(X) to denote the unit ball B(X) := {/ € 
X : \\f\\x < 1}. Thus rB(X) = B{rX) is the ball of radius r, 

B(X) + B(y) = B(X + Y) = {/ + g : / e B(X),g e B(Y)} 

is the unit ball of the Banach space X + Y, TB(X) = B(TX) is the unit ball of 
TX, etc. We use the embedding notation 

X CY 

to denote the estimate B(X) C CB(Y), or equivalently that / £ Y and ||/||y < 
for all / G X. We will need this rather unusual notation due to our reliance 
on compound spaces X + Y in some of our later arguments. 
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We use <j) to denote the spatial Fourier transform 

: = f e -' lx <(t){t,x) dx 
Jr 3 

and define fractional derivative operators in the usual manner: 

|Vxfa(*, ■= \t\ s i>(t, 0; (Cm*, := (£>*<?(*, 0- 

We recall the Sobolev multiplication laws on R 3 . Specifically, we have 
(14) 11*1 < HWh'i W\\h? 

whenever si + S2 > 0, s < min(si, S2) and s < si + S2 — §. See e.g. {37]. Of course, 
the implicit constant here depends on s±, S2, s. A special case of this inequality is 

Lemma 2.1. If s > 3/4, then \\uvWjj-i < IMItf'IMI.H'* -1 - 



Proof. By duality it suffices to show that ||itio||jj-i-s < IMIif^ll^lljifi- If w is sup- 
ported in the region |£| > 1 then H^, is equivalent to H]. and the claim follows from 
(fT4|) (since s > 3/4). Thus we assume that w is supported on the region |£| <C f . 

Taking the fractional derivative (V^) 1-8 and applying the fractional Leibnitz rule 
(taking Fourier transforms and assuming u, w to be real and non-negative if desired) 
we reduce to showing that 

IKVx^-HU; £ IMIh-IMIhj 

and 

WdV^-uMu* < \\u\\ H s\\w\\ Ai 

Since w only has low frequencies, the operator (V^) 1-8 is harmless when applied 
to w, and it will suffice to prove the latter inequality. But we can use Holder to 
measure (V 2: ) 1 ~ s u in L 3 and w in L x , and the claim follows from Sobolev and the 
assumption 3/4 < s < 1. □ 



We shall also use the Sobolev embeddings ill' 2 C L 3 , Hi' 4 C L*, Hi C L%, and 
H^ 2 ~ n H^ 2+ C extremely frequently in the sequel. Of course these homoge- 
neous Sobolev embeddings imply various inhomogeneous Sobolev embeddings, e.g. 
that H* C L 3 whenever s > 1/2. 

As mentioned in the introduction, one of the technical difficulties with (MKG-CG) 
is that it is not always possible to control the low frequency portiorf] of the fields 
(/lo,J>) satisfactorily in L x norm. To get around this we shall estimate the low 
frequencies in other LP X norms. In this section we develop some of the theory of 
frequency-localized Lebesgue spaces. 

^In the introduction, it was only Aq which had difficulty getting into the L 2 norm, and the 
[H s ] norm allowed us to control $ in L 2 . However, for the global existence argument we shall 
need to rescale the fields (Aq,$) by a large dilation factor A. This rescaling is needed to make 
the (subcritical) Hamiltonian small, but it also makes the (supercritical) L 2 norm large. While 
it is possible to continue using the L 2 norm, it leads to inferior numerology (in particular, the 
range of possible s is greatly reduced) so we shall avoid doing so, using other (non-supercritical) 
Lebesgue spaces such as L| as substitutes. 
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Definition 2.2. If 1 < p < oo and R > 0, we define the space L R to be the subspace 
of L P (R 3 ) consisting of those functions whose Fourier support is contained in the 
ball |£| < R. (We keep the L p norm structure on this subspace L^,.) 



We will use very specific instances of these spaces such as L^, Li5, and Lj^. 

Observe that if R is bounded, then derivatives are bounded on L P R : 

(15) W X L P R C I* . 

This is clear since V is equivalent to a standard symbol of order on frequencies 
|£| < R. From this and Sobolev embedding (or Bernstein's inequality) we see that 

(16) L* C L% 
whenever p < q and R < R' . 

The functions in \F R are thus very smooth (in fact, they are analytic). The p 
exponent thus does not measure regularity, but instead controls the decay at infinity. 

From Holder's inequality we have 

(17) ^R ' ^r> £ ^R+R' 

whenever 1/r = 1/p + l/q, since the frequency support of a product is contained 
in the sum of the frequency supports of the factors. 

In particular, if R is bounded, then functions in h R are bounded, and so 

L P R -L 2 QL 2 . 
From (|15[) and the Leibnitz rule we thus have 

(18) ■ El C ff« 

for all integer s > 0. By duality this also holds for integer s < 0. By complex 
interpolation this thus holds for all real s. 

Finally, we prove an "energy estimate" for the TP R spaces. Let us restrict spacetime 
to a slab [to — 6, to + S] x R 3 for some < 5 <C 1 and some time to ■ Suppose u is such 
that u[to] S L P R and Du G LjL p R , with R bounded. Then we have V Xj tU £ C^L P R 
with 



(19) IIV*,HI C ol» ^ II«[*o]|Il* + \\° u 

Indeed, this follows from the Duhamel formula 



u(t) = cos((t-t )v /z A)u(t )+ U J^- -u t (t )+ / — ^ — 7 =^ '-Du{t') 

and the fact that V K , cos((t— to)\f— A) and sm (( t ^ t <^W^ A ) are equivalent to symbols 
of order for frequencies < R and times t & [to — 6, to + 8] . 
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3. The elliptic theory of A 

In this section we develop some elliptic theory for how the connection component 
Aq depends on <j> and A. We shall establish a smoothing effect that allows us to 
place Ao[t] in H\ x L\ even if <j>[t] and A[t] are merely in iJ| x H^T 1 for some 
3/4<s<l. 

The equations ([3]), (|4|) for a fixed time t can be rewritten as 

(-A + |0| 2 )Ao = Im(</>&) 

V x A 0tt = -(1 - P)Im(^V^) + (1 - P)(A\<f>\ 2 ). 

We view ([20]) as a linear elliptic system for two unknown fields Ao,Ao }t in terms 
of data <fi, (f> t , A; the £ subscript here should be viewed as simply a label, thus A o t 
and 4> t are not being interpreted here as the time derivatives of A$ or 4>. 

Our main result here is as follows. 

Proposition 3.1 (A estimates). Let 3/4 < s < 1, let <fi,A £ H*, and <p t € H^~ l 
(we do not assume here that <fi,A,(j)t solve (MKG-CG) or obey any compatibility 
conditions) . Then there exists unique Aq S and Aq^ £ L? x obeying (I20|) , and we 
also have the bounds 

(21) WMhi Z WMh^ 

and 

3 

(22) \\a \\^, \\Ao4v % 'EIWWh. + H\\ H . + WMm-iy- 

3=2 

Furthermore, if(f>',A G H x and A[ t € H^T 1 , and A' ,A' ot are the associated solu- 
tions to ([20 j) . then we have the local Lipschitz bound 

(23) WAo-A'oW^WAoi-A'ojU* < M^U-^'Us+U-^Hs+Ut-^^} 
where 

(24) M := 1 + \\<j>\\ H . + \\A\\ Hl + Ut\\ m -i + W\\ H s + \\A\\ H . + Wt\\ H ^- 

Remark 3.2. If (f> is suitably small (e.g. if H^Hl^, -C 1) then we can iterate away the 
linear term A |^| 2 , and controlling A in terms of <f> is straightforward. However 
we do not assume any smallness condition on (p, and so we must proceed with some 
care. Inparticular, we must augment perturbation theory with some variational 
methodftl 

Proof. The Schrodinger operator —A + |</>| 2 maps H x to H^ 1 (using (fT4")l ). and is 
clearly positive definite. From Lemma [2. II we have lm(0</> t ) e H^ 1 . From (|20l) we 
thus conclude that Aq is unique as claimed. The uniqueness for Aqj is obvious. 



^Alternatively, one could localize in space and use standard elliptic regularity theory. However 
this requires the usage of local Coulomb gauges, which have their own attendant technicalities, 
see e.g. [20], [39]. 
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To prove the remaining claims in the proposition, it suffices by the usual density 
arguments to verify the case when </>, 4>t, A are smooth and rapidly decreasing in 
space, which we shall now assume throughout. 

From ([20]) and standard Euler-Lagrange theory, we see that Aq can be now be 
constructed as the unique minimizer in H\ of the convex functional 



(25) 



L^Mo) ■= / |V X A | 2 + |0 t + iA <j)\ 2 dx. 

1 JR3 



This gives existence of Aq. The existence of Aqj is clear from Hodge theory, since 
the right-hand side of the second equation in (|2"0"|) is curl-free. We shall now use 
this variational formulation to establish the bounds (1221) . (|23]> . 



At first glance it seems we are in trouble when s < 1 because (f25f cannot be con- 
trolled by the H^, x H^T 1 norm of </>[t]. However, we may "renormalize" L^^^Aq) 
by defining 

= J ^IV.AoI 2 + Aolm(^) + ±\A \ 2 \<t>\ 2 . 

In particular, since L^^^o) < 0, we have 

/ |V^ | 2 < / Aolm(^) 

so by Cauchy-Schwarz we have (f2"Tj) . From Lemma 12.11 see that Aq verifies 
The claim for Aq^ is much simpler, following easily from Lemma |2.1[ Sobolev 
and Holder. 

It remains to establish ((23]) . We fix <j),A,<frt,4>',A',(/)' t (and hence Aq,A' ). From 
we have 



(26) Pollfli + Klljjx SM 2 . 

If we write Aq — A + /i and Ao,i = A t + /i f , our task is to show that 

(27) \\h\\ H , + \\h t \\ Ll < M 5 (U- + ||A- A'Hh. +U t - <t>' t \\ m -i). 

We begin with the estimation of From the variational characterisation of 

Aq we have 

V,^K)-V,^(A ) < 
and thus by the triangle inequality 

(28) L^ t (A' ) - L^ t (Ao) < \L^ t (Ao) ~ h',<t>'Mo)\ + \h,<t>M'o) ~ h'^iOl- 
Since 

L^ t {A' ) - L^ t (Ao) = \l |V^| 2 + |/f|0| 2 > 

2 Jr3 

we thus see that it suffices to show that 

(29) \L Mt (Ao) ~ < M 5 U[t] - ^[t]|| H . xH .-i 



®The expression d 2 5 1) can also be interpreted as the component of the Hamiltonian J 13H which 
depends on Aq. 
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and similarly for Aq replaced by A'q. Using the definition of L, we may estimate 
the left-hand side of ([22]) by 

< / |A)||#r-^| + |^o| a |H a -|^| 2 |. 

Splitting 4>(j>t — 4>'4>'t as a sum of two differences and using ([24| and Lemma |2~T1 we 
obtain 

\\<ff t - < m\W] - <p'[t]\\ H s XHi -i. 

Also, from (PIfJl) and Sobolev we have 

IIIAonU^IIAolli^HAoll^^M 4 
and from Sobolev we have 

\M 2 -\<f>'\ 2 \\ Ll/ 2 < (Hhi + U'\\Li)H-<t>'hi <MM-#\U t 

and the claim (|29p follows. This yields the desired bound (|27p for The 
analogous claim for h t follows from ([3]), Holder, Sobolev, and Lemma |2~T1 as before. 
This gives (j2"T)) and hence (1231) as desired. □ 

Remark 3.3. Heuristically, Proposition 13.11 allows us to eliminate Ao from (MKG- 
CG), and think of this system as an evolution purely in $. Indeed from the above 
analysis one morally has Aq A _1 ($V:i:$). However we shall keep Ao explicit in 
our computations. 

Remark 3.4. Proposition 13.11 asserts that Aq is somewhat smoother than $: it is 
in £Tj even though 4> is merely in H£. However we cannot place Aq in H], or even 
in L x because of the slow decay of Aq at infinity mentioned earlier. 

Remark 3.5. Proposition 13. II implies that rough initial data $[0] G [H s ] (see (fTTj) ) 
obeying (|5J) can be approximated arbitrarily closely in [H s ] norm by smooth initial 
data $'[0], also obeying ([5]). We sketch the argument as follows. Given <£>[0] G [H s ] 
we can first approximate $[0] in if| x H^T 1 norm by a smooth $'[0] which still 
obeys the divergence-free condition V x ■ A'[0] = 0. Then we construct A (0) as 
above, and d t A' (0) by @. From ^ we see that V x ,tA' (0) is close to \7 Xyt A (0) 
in L x , and hence in H^T 1 . Combining all these estimates we thus see that $'[0] is 
close to $[0] in [H s ] norm as desired. 

4. Fixed-time Hamiltonian estimates 

Using the elliptic theory for Aq and the machinery of frequency-localized spaces, 
we are now ready to understand the Hamiltonian (|13p . 

From (|13[) and the triangle inequality we have 

H[*[i]] < 1 1 V^^o (* ) 1 1 i= H- 1 1 V^, ^ (*) 1 1 i= H- 1 1 V^.^C*) 1 1 i= H- 1 1 ^lo (*) 1 1 i= -h 1 1 -4 (*) 1 1 i= - 
From Holder's inequality and the Sobolev embedding H x C L x we thus have 
(30) H[*[i]] < (IIV.AoWIII, + ||V x ,t4(i)llis)(l + \m)\\hj- 

This allows us to control the Hamiltonian by the [H 1 ] norm. 
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Now we look at the converse: given that the Hamiltonian is finite, what bounds 
can we place on $? This question was studied in [24j . however in that paper some 
L 2 control on <f> and A was also assumed at time zero. We will not be able to use 
such control as the L 2 norm is supercritical and so will behave badly with respect 
to a rescaling argument which we will use later. Fortunately, we can still obtain 
good control on $ without the L 2 norm, although some odd things happen at low 
frequencies. 

Lemma 4.1 (Hamiltonian controls H 1 ). Let t E R be fixed. Suppose that Ao(t), 
$[t] = (A(t),4>[i\) are [H 1 ] functions such that V x ■ A[t] = and H[$[t]] < 1. Then 
we have the estimates 

(31) S(t) e CB(Hl + L\) 

(32) V B ,t*(i) G CB(L 2 X + £§) 

(33) V x A (t), V w , t A(t) e CB(L 2 X ). 

Informally, control of the Hamiltonian allows one to place most of Ao(t) and 
in H x xLj, except for the low frequency component, which is only in L x or L x . 
The hypothesis that Aq : Q_ are [H 1 ] functions is a purely qualitative hypothesis; the 
constants C do not depend on the [H l ] norms of these functions. 

Proof. From the hypothesis and (|13p we have 



(34) V x A (t)-d t A(t)eCB{L 2 x ) 

(35) V x x A{t) e CB(L 2 X ) 

(36) D Q <j){t) e CB(L 2 X ) 

(37) Dfftt) e CB(Ll). 



From ([35j) and the hypothesis V x -A(t) = we have A(t) € CB(ij£). Also, by taking 
divergence- free and curl- free components of (l34l) using the hypothesis V ' x -dtA(t) = 
we see that 

V x A (t),d t A 3 (t) e CB{L 2 X ). 

Combining these estimates together we obtain (|33|) . Using the embedding Hi. C 
Hi + Li which comes from applying Sobolev embedding to the low frequencies of 
Hi , we thus see that Aq and A satisfy the required estimates (|3i"T) , ([32l . 

It remains to show the corresponding estimates for tfi. We begin with the pointwisc 
identity 

2|#f)|0 3 -|0(t)| = ^(|0(i)| 2 ) = 2Rem)dJW)) = mem)TJ~W)) 
for j = 1, 2, 3. In particular we have the "diamagnetic inequality" 

\diW)\\ < \Dj<t>(t)\- 

®One might consider adding a mass term to (MKG-CG) and posing the same global well- 
posedness questions. It seems likely that one has similar results for the massive (MKG-CG), 
however the argument would be technically more complicated due to the Schrodinger-like be- 
haviour of low frequencies. Also, the mass term is still supercritical and so this does not solve the 
difficulties of using the L 2 norm. 
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From (|37]) . the Sobolev embedding C L|, and the trivial observation that |</>(i)| 
and have the same L® norm we thus have 

(38) <jy{t) e CB(Ll). 

Also, from our estimates on the Aj and Aq and Sobolev embedding we have 

(39) A (t),A 3 (t)eCB(Lt). 
By Holder we thus have 

i4o(t)^(t),^.(t)^(t)€CB(L»). 
Combining this with ([36]). (|37]) we obtain 

(40) d t ^t),d^(t)eCB(L 2 x + Ll). 

On the other hand, if we take the divergence of (|37|) using the hypothesis W x -Alt] — 
we obtain 

From 0D]), (J3S} and Holder we have 

iA J -(f)a,#t)eCB(Z# 2 + i£)- 

From Sobolev we have h\ , L^H^ 1 C If" 1 . From this and the previous we thus 
have 

A0(t) e CB^" 1 ). 

We now divide </>(i) smoothly into a low frequency component supported on |£| < 1, 
and the remainder supported on |£| > 1/2. From (|38|) and the above equation we 
see that the low frequency part is in L^ and the remainder is in H^, so (pit) obeys 

D. 



It only remains to show that dt<p(t) obeys (j3"2"]) . From (|31)]) it suffices to show that 

A (t)4>(t)eCBiLl + Ll). 

Since we have already shown that Ao(t), (pit) obey ([3"T]) . the claim then follows from 
CL1, (HHJ), and ([IT]). □ 



5. Global well-posedness: preliminary reduction 



We now begin the proof of Theorem ll.8l Fix < s < 1, and fix the initial data 

<I>[0] obeying the hypotheses of the theorem. Let T* denote the maximal time of 
existence for which one can construct a solution $ in [H s ]; our objective is to show 
that is infinite. 

In Section [5] we shall prove the following local well-posedness result (essentially due 
to Cuccagna [10]): 

Theorem 5.1 (Local well-posedness). Let 5/6 < s < 1 and M > 0. Let $[0] £ 
MBi[H s ]) obey the compatibility conditions ([5]). Then there exists a solution $ 
to (MKG-CG) with initial data $[0] on the time interval [—T,T] for some T — 
T(M) > 0. Furthermore, for each time t € [— T, T] the solution map $[0] i— > is 
a Lipschitz map from MBi[H s ]) to [H s ] (with the Lipschitz constant depending on 
s and M ). 
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Remark 5.2. In view of the work of Cuccagna|10j. the local existence theorem here 
should in fact extend to the range s > 3/4, and a possibly weakened version of this 
local existence theorem should also hold in the range s > 1/2 thanks to the work 
of Machedon and Sterbenz[30]. However, to avoid technicalities we will restrict 
ourselves to the case s > 5/6 (which covers the range s > v3/2 that our main 
theorem covers). 

Assume this theorem for the moment. Then we have > 0. Furthermore, if 
is finite, then Theorem 15.11 forces one to have lim^^ = +oo. Thus if 

we can prove the polynomial growth bound (|12[) for t < , we will have obtained 
global well-posedness. 

By another application of Theorem 15.11 and a standard limiting argument (using 
Remark I3.5P we may assume that $[0] is smooth and [H ], in which case we have 
a global smooth and [if 1 ] solution from the results in the introduction f [111 124) ). 
Thus it will suffice to prove (fT2]) for global smooth solutions. 

Henceforth our constants C are allowed to depend on ||$[0]||[iy«i, thus for instance 

II*[0]||[H-]<1. 

Fix the time T in (|12|) . In view of Theorem 15. II we may assume T > 1. As is usual 
in applications of the /-method, we will need to rescale the equation using (|10[) . 
replacing $ by the rescaled solution 

for some large A = A(T) ^ 1 to be chosen later. Note that also solves (MKG- 
CG). In order to obtain (fT2"|) at time T we will need to control $' A ) at time XT. 

We would like to use the Hamiltonian H[$^ A ^[i]] defined in (TT3"|) . Unfortunately we 
do not have enough regularity on $ to ensure this Hamiltonian is finite since s < 1 . 
(On the other hand, A$ has enough regularity thanks to (122]) .) To get around this 
difficulty we shall use the method of almost conservation laws. 

We pick a large number N = N(T) ^ 1 to be specified later. Let m(£) be a smooth 
radial positive symbol such that m(£) = 1 for |£| < N and m(£) = l^l^ 1 /A^ -1 for 
|£| > 2JV, and let I be the Fourier multiplier 

lf(0 :=m(0/(0- 

Thus / is the identity for bounded"! frequencies |£| <C N and is smoothing of 
order 1 — s for high frequencies |£| > N. Observe that the convolution kernel 
of I is integrable, thus I is bounded on every translation-invariant Banach space. 
Furthermore, we have the smoothing estimates 

(41) \\u\\ H s < \\Iu\\ H i < N 1 -'^. 



We have two frequency cutoffs in our argument, one at 1 and one at TV. To avoid confusion 
as to what "low" and "high" frequency are, we refer to frequencies |£| < 1 as low, frequencies 
1 <SC |£| <SC N as medium, and frequencies |£| > TV as high. We will also refer to frequencies 
|J| <SC TV as bounded, and frequencies |£| 2> 1 as local. 
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(42) WIuWh^N^Mh^ 

We will use H[/$( A )[i]] as a substitute for the Hamiltonian. Unfortunately, the loss 
of 7V 1_S in (|4"Tj) would make the modified Hamiltonian large. However, the scaling 
parameter A (combined with the fact that the energy regularity H\ is sub-critical) 
can be used to rescale the Hamiltonian to be small again. More precisely, we have 

Lemma 5.3 (Rescaled Hamiltonian is small). Suppose we choose iV > 1, A > 1 
so that 

(43) X 1 / 2 -^ 1 - 3 < 1. 
Then we have H[/$( A )[0]] < 1. 



Proof. Observe that 

n^ (A) (o)iUi < ii0 (a) (o)Hl3 = wmhi < wmwm < 1 

so by (00]) it will suffice to show that \\V X IA ( X) (0)\\ L 2 < 1 and ||V X)t /* (A) (0)||i2 < 
1. The former estimate is easy, in fact by (|42l) . (1221) we have 



For the latter estimate we use (l4"2l) : 

||v x , t i$ (A Ho)|U s < ^ 1 -||v x ,t* w (o)|| A .- 1 < jv 1 — a^—iiv^co)!!^-! 

and the claim follows from (j43|) since ||^[0]||rjy s ] ^ 1- D 

Thus we can make the modified Hamiltonian small at time zero. In order to control 
the modified Hamiltonian at later times we use the following almost conservation 
law for the modified Hamiltonian: 

Proposition 5.4 (Almost conservation law). Let (Ao,&) be a global smooth solu- 
tion to (MKG-CG), and suppose to is a time such that 

(44) H[J$[t ]] < 2. 

Then we have 

1 



(45) H[I$[i]] = H[7*[i ]] + O 



JV(*-l/2)- 



for all t € [to — 5/2, to + 5/2], where N° <C 6 <C 1 is a small constant depending 
only on s and N . 

Remark 5.5. The error of O ( N(s J~ 1/2) „ ) corresponds to the restriction s > V3/2, 
but is not optimal. In particular it seems feasible that one could improve this error 
to 0(7V _1 / 2+ ), which would in principle allow us to obtain global well-posedness 
for s > 5/6. For the equation (NLW), an error of 0(N~ 1+ ) is attainable, which 
corresponds to the regularity s > 3/4 (cf. [17]). By combining this conservation 
law with additional techniques, the global well posedness of (NLW) was extended 
to the range s > 13/18 in [31., with a further gain to s > 7/10 in [32] (in the 
spherically symmetric case). 
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The proof of Proposition 15.41 is rather lengthy and (together with Theorem 15. ip 
will occupy Sections fTOllTBI For now, we see how this proposition implies ITS]) and 
hence Theorem 11.81 

From Lemma \5. 31 and O^N^^ 1 / 2 ^) applications of Proposition HT4l we can obtain 
the estimate 

(46) sup H[/$ (A) [A<]] < 2 

0<t<T 

provided that 

(47) AT< AK S -V2)- 

A little algebra shows that we can choose A 3> 1, N 3> 1 so that (|47|) and (|43]) 
simultaneously hold, so long as s > \fi/2. Furthermore, both A and N are at most 
polynomial in T. 

To finish up we use an integration in time argument inspired by a similar argument 
from [19]. From (l46l) and Lemma |4~T1 we have that 



(48) V x>t I$ w (t)eCB(Ll + L 3 2 ) 

for all < t < T (note that IA^ is automatically divergence free). Also, from 
(|4"Tj) we have 

(49) 7$ (A) (0) G CT c B(Hl) C CT C B(L2). 
Using the fundamental theorem of calculus 

/$( A )(t) = /$( A )(0) + / <9 4 /$ (A) (i') dt' 
Jo 

we thus see that 

I$ w (t) € CT c B(i2 +L 3). 

Splitting $ smooth into low frequencies |£| < 4 and a remainder term |£| > 2 and 
using (l4"51) we obtain 

J$ (A) (i) G CT c B(Hl +h\). 
On the other hand, from Lemma 14.11 we have 

A^\t)eCB{Hl+h\). 
Multiplying the two using the Sobolev embedding H\ C £ 4 , we obtain 

4 A) (t)/$ (A) (<) g ct c b(l2). 

But from (gH]), (jTHl) we have 

d t I^ x \t) + iAi x) (t)I4>W{t) G CT C B(L2) 

and hence 

d t I(f> w (t) £ CT C B{L 2 X ). 
Combining this with (|33[) we obtain 

d t I$ W (t) G CT C B(L2). 
Applying the fundamental theorem of calculus and (|49l) again we obtain 

I$( A )(AT) G CT C B(L2). 
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Undoing the scaling we thus obtain (fT2)l as desired. This proves Theorem 11.81 

It remains to show Theorem l5 . 1 l and Proposition l5.4l This will occupy the remainder 
of the paper. 



The system (MKG-CG) may appear excessively complicated, due to vector struc- 
tures, Riesz transforms, complex conjugates, and constants such as 2i. To clean up 
some of the clutter we shall adopt some notational conventions to reduce (MKG- 
CG) to a "caricature" form, which we will then use to prove both Theorem 15.11 and 
Proposition EH1 

We adopt the convention that if A is a scalar-, vector-, or tensor- valued quantity, 
then 0(A) denotes an expression which is schematically of the form A, or more pre- 
cisely a finite linear combination of expressions of the form TiKe(Ai) and T-lm(Ai), 
where Ai denotes the various components of A and Ti, T[ either denote constants 
or Riesz transforms (which arise due to the presence of the Leray projection P). 
We recall the well-known fact from Calderon-Zygmund theory (see e.g. [35]) that 
these operators are bounded on L v x for every 1 < p < oo. We can then define 
quadratic schematic expressions 0(AB) and cubic ones 0(ABC) by using the con- 
vention that AB denotes the tensor product of A and B (viewed as real tensors 
rather than complex, thus for instance Re(^4)Im(£>) = 0(AB)), etc. For example, 
we have 



6. A CARICATURE FOR MKG-CG 



Im(<f>D <f>) 



0(0&$+0(4)#) 
0(tVx,tt) + 0($$$) 

0(0V0) 

0(tVx,tt) 
0($$$) 

0(A o d t $) 
0((d t A o m 



(1 - P)(Im<£V x <£) 



(l-P)(A\^),P(A\^),\A\ 2 ^\Ao\ 2 ^ 

2iA d t (j} 
i{dtAo)(f) 



and we can therefore rewrite (MKG-CG) in the caricature form 

V x V^o=0(N 2 )+0(N 3 ) 
; ' >(i) □$ = 0(N o ) + 0(N 1 ) + 0(N 3 ) 
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where the bilinear and trilinear nonlinearities No, Ni, N2, N3 are defined as the 
tensors 



(51) No := (PIm(0V^), PQ4) • V x <j>) 

(52) Ni:=((Mo)t,4>8t£) 

(53) N 2 := $V,, t $ 

(54) N 3 := 



Remark 6.1. The cubic nonlinearity N3 is relatively easy to deal with. The non- 
linearity N2 would be dangerous if it were present in the "hyperbolic" equation for 
□ <&, but fortunately only affects the "elliptic" equation for Aq, which has better 
smoothing effects with which to handle this nonlinearity. The nonlinearity Ni is 
tractable due to the high regularity of Aq. The null form No = No(<&, $) is perhaps 
the most interesting. It is a special case of the more general quadratic form N2, or 
more precisely 

(55) N = 0(N 2 ). 
However, one can express No more carefully as 

(56) N ($,10 = 0(\V x \- 1 Q($,&)) + 0(Q(\V x \- 1 $,&)) 

where Q is the null form Q(4>, ip) := ^ x <t> A Vx^j which can be expressed in com- 
ponents as 

Qjk((f>, ip) ■= dj(f>dkip - dkfidjip. 

See [24] for more details. 
Remark 6.2. The equation 

□ $ = 0(N O ($,$)) 

is sometimes used as a simplified model for (MKG-CG) (and also for Yang-Mills 
equations in the Coulomb gauge); see e.g. [25], [27]. However we will not use this 
model equation here. 



7. Function spaces 

We now recall some notation for the function spaces we shall use to control the 
nonlinear expressions No, Ni, N 2 , N3 properly, which will be useful both for proving 
Theorem 15. II and Proposition 15 .41 

Given a spacetime function <fr : R x R 3 — » C, we use <fr to denote the spacetime 
Fourier transform 

4>(£,t):=[ e- t{x ^ +tT) cj){x,t) dxdt. 

JRxR 3 

Of course, the spacetime Fourier transform only makes sense if <p is defined globally 
on R x R 3 (as opposed to a spacetime slab such as [0,T] x R 3 ). In practice 
this difficulty is avoided by using the spacetime Fourier transform to define global 
function spaces, and then define their local counterparts by restriction. 
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If X is a Banach space of functions on R 3 , we use LfX to denote the space of 
functions whose norm 

\\u\\ Lix :=([ \\u(t)\\* x dt)V« 

JR 

is finite, with the usual modifications when q — oo; we also let C®X be the space of 
bounded continuous functions from R to X with the supremum norm. In particular, 
we have the mixed Lebesgue spaces L\U X and the energy spaces C®H% n C\H^ X . 
These spaces localise to spacetime slabs I x R 3 in the obvious manner. 

For any s, b G R, we denote the spaed"! H s ' b = i/ s,h (R x R 3 ) of spacetime functions 
on R x R 3 whose norm 

Nlff- ■■=\\(0 S (\t\-\T\) b u\\LlLl 

is finite. We observe the trivial inclusions H S2 b2 C H sibl whenever S2 > si and 
b 2 > h. 

If J is a bounded time interval, we define the restricted space Hf = H s b {I x R 3 ) 
to consist of the restriction of H s,b functions to the spacetime slab / x R 3 , with 
norm 

:= mf{||v|| Hs ,b : v € H s ' b ,v\ IxR 3 = u}. 

We now formalize the well-known fact that iJ s ' 1 / 2 + functions are "averages" of free 
H x solutions to the wave equation (see e.g. [33l Proposition 7] or [38l Lemma 2.9]). 

Lemma 7.1 (H s b decomposes into free solutions). Let cf> € H s ' b for some b > 1/2. 
Then for each A G R there exists a global solution <j)\ to the free wave equation 
a = with \\4>\[t)\\ H , xH s-i ;$ 1 for all t, and a co-efficient a(\) G R, such that 

cf>(t) = [ a{\)e ttx 4> x (t) dX 

for all t, and such that 

M\Ll<\\{>) b a\\ Ll <U\\ H ^ 
where the implicit constant can depend on b. 



Proof. Without loss of generality we may assume that the spacetime Fourier trans- 
form (f> is supported on the upper half-space {(r, £) : t > 0}. We then write 

£(r,0= / 0(le| + A,^(r-A-|e|) d\ 

JR 

where S is the Dirac delta. If we then define 

o(A) := ll(e>>(|ei +A,0IL| 

and 

Mr,0 :=-ir«(T-|e|)0(|e|+A,e) 



These spaces are also known as X 3 ' b spaces in the literature. 
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we see that all the relevant properties are easily verified except perhaps for the L\ 
bound on a, which we compute using Cauchy-Schwarz: 

HUi < ii(A) fc a|| L5 = \\(r- m b (o s n Ll Li = H\\ H ^. 

□ 



As a particular consequence of this lemma, we see that if one can imbed x 
H^T 1 free solutions in a spacetime Banach space X which is invariant under time 
modulations <f>{t) n- 0(i)e ltA , then one can also imbed ff s ' 1 / 2 + solutions into the 
same space. In particular we have 

(57) £p,l/2+ g L ^ H s^ 

for any s £ R. Also, from Strichartz' estimate (see e.g. [15] , [36], and the references 
therein) and Lemma 17. II we have 

(58) C L\U X 
whenever 

a > 

I + I < I 

(59) s r 7 2 
v ' 2 < q <oo 

q r — 2 

except at the endpoint (q, r, s) — (2, oo, 0), where the estimate is known to fail (see 
[T^]). If time is localized to an interval, one can also use Holder in time to lower 
the q index. 

Finally, we recall 

Lemma 7.2 (Energy estimate). [331 Theorem 12] For any time to, any interval I 
of width 0(1) containing to, any < c < 1 — b, 1 > b > 1/2, and s G R. we have 



-l,b-l + a 



(60) \\u\\ H ,. b + UV^II^-i,* < \\u[t }\\ H s xHi -i + \I\ a/2 \\au\\ H , 

whenever the right-hand side is finite. The implied constant of course depends on 
a, b, s. 

Note the factor of \I\ a l 2 on the right-hand side of ([60|) : this factor will be very 
convenient for the large data theory. The fact that a is allowed to be as large as 1 — b 
(rather than 1/2) allows us to reach s > \fij2 w .866 rather than s > 7/8 = .875. 



8. Bilinear estimates 



To obtain local well-posedness in H*, we shall place $ in H s,s , V x ,t$ in H s 1,s , 
and V x j,A Q in ffVa+.o (compare with [10]). 

We are now ready to prove the main bilinear estimates needed to handle the non- 
linear expressions No, Ni, N2. 
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Proposition 8.1 (Bilinear estimates). Let 3/4 < s < 1. Then we have the esti- 
mates 

(61) IW^V^H H s' + s"-2,0 < 11011//=', 3/4+ || V x j4>\\ H s"-l, 3/4+ 

(62) ||»/(i)(MoMlir-w < \\V x ,tA \\ H1/2 +,o\\<t>\\ H ., 3/ 4 + 

(63) ||» 7 (*)^o(ft0)|| H .-i..- I < HV^oll 

(64) |h(*)N O (0, < 11011^,3/4+ ||V|| ff .,3/4+ 

where rj is any bump function and s < s',s" < 1 are exponents which are not both 
equal to 1, and Aq, <p, ip are arbitrary functions for which the left-hand side makes 
sense. (Of course, the implicit constants depend on s,rj, s' , s" .) 

These estimates were essentially proven in [TO], but we sketch a proof here based 
on the bilinear estimates in [12]. For local existence we need to take s' = s" = s, 
but for global existence we will also need one of s', s" to equal 1 instead. 

Proof. We first prove From |T21 Theorem 1.1] and the assumption 3/4 < s < 1, 
we have[3 

||0V x VII H"' xH"'- 1 H^'xH*"- 1 

for all global solutions = dtp = to the wave equation, and ||u|| jj Btb := || |£| s ||£| — 
|r|| b M|| L 2 i 2 is the homogeneous L 2 X norm. 

Now we prove the variant 

(65) h(i)0V,V|| H s' + s"-2,0 

H s " xH'"- 1 - 

If <p and ip both have Fourier support on the region |£| > 1 then this follows from the 
previous estimate, since s'+s" — 2 is negative, and the H s +s ~ 2,0 norm controls the 
H s +s ~ 2 norm, and the rj(t) cutoff is harmless. Now suppose that <p has Fourier 
support in the region |£| < 1. Estimating V x ^ip m C^H S _1 and (p in C'^H 10 (for 
instance) it suffices by a Holder in time to show the estimate 

Wf9\\ H s'+s"-2 < ||/|| ff io||5|| H s"-i. 

But this follows from (H3J). A similar argument holds for ip (in fact here it is easier 
as the derivative is in a more favorable location). This proves (f^5|) . 

Now suppose that <p is an arbitrary spacetime function, while ip still solves the wave 
equation dip — 0. We claim the estimate 



II^V^|| H .'v'- a .o £ 11011^',3/4+IIV'loill^xffr 1 - 

To see this, we use Lemma ITTTI to write (p(t) = f R a(\)e ltX (p\(t) dX. From (fFJS"]) . 

Minkowski's inequality and the observation that the norm H s +s ~ 2,0 is invariant 
under multiplication by e itx , we have 

12 More generally, one has \\(fnl>\\ ^s,b < II^IO]!! jjs'xjjs'-i IM0]|| fj B " x jj»"-i whenever s + b = 
s' + s" — 1, s' + s" > 1/2, s > —1, b > 0, and s < min(s',s"), with at least one of the latter two 
inequalities being strict. See 1121 . 
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||^v a> ^|| H ./ + .»_ a , < hhiJimWH^m- 1 

and the claim follows from Lemma 17.11 If we then apply similar reasoning to tp we 
obtain 

(66) ||r?(t)^Vx^||_ ff .'+^'-2,0 < ||0||jJ.',3/4 + ||V'||iJ*",3/4 + . 

To establish (|61l) in the case where tp has Fourier support in the region |£| > 1, 
one applies (|66|) with tp replaced by V x A~ 1 Vt,xip and then takes traces. The 
final remaining case of (|61|) to check is when tp has Fourier support in the region 
|£| < 1. In that case, the norm || Vt )X ^|| HS "_i,3/4+ controls || Vt^WLfH 100 ( sa y): 
while |]0||#v,3/4+ controls ||</>||i~jj s ' , and the claim follows from standard Sobolev 
product estimates. 

Now, we prov e ffity . From [T2J Corollary 13.4] and (|5rJj) . and the hypothesis 3/4 < 
s < 1, we hava 13 ! 

when Dcp = Dtp = 0. We now prove the variant estimate 

(67) n^)N o (0,voik— < \\m\\ HixH i-4m\\ HixHl -^ 

Again this estimate follows from the previous when <p and tp both have Fourier 
support in the region |£| > 1 (note that multiplication by rj(t) is bounded on any 
H s ' h space). Now suppose <p has Fourier support on |£| < 1. Crudely writing 
No(<^>, tp) = 0{(pV x tp) and estimating the JJ^ 1 ^- 1 norm by the LlH^T 1 norm, we 
argue as with (f6T|) . the only difference being that —1/2+ has been replaced by s— 1. 
Similarly for tp. 



Now we invoke Lemma lTTTl again. We have to be a bit more careful because the space 
£p-i,s-i j g no ^. mvar j a nt under multiplication by e ltA , however from the (rather 
crude) elementary inequality (a + fe) s ~ 1 < (a) 1_s (6) s_1 we do have the estimate 

\\e™u\\ H .-i,.-i<{\) 1 -'\\u\\ H .-i,.-i. 

So if (p does not solve the free wave equation, and we decompose as in Lemma |7. 11 
then 

IMfJNo^lU.-!..-. < / (A) 1 — |o A (*)| ||»/(*)No (^a, ^) || ff— i.— i dX. 



Using (|67|) we thus have 

||»j(i)No(^^)|| H .-i,.-i < \W- s a\\ Ll \\m\\ HixH ^- 
However since s > 3/4, we have 3/4+ > 1/2 + (1 — s), so we can estimate 



Ktf-'ahl ! <ll<A> 3/4+ a|U 5 < 



By arguing similarly for tp we obtain ([64 



^The main idea of this corollary is to exploit the heuristic Q(tj>,ip) ~ 

|V ;c | 1/ ' 2 -D^_ / ' 2 (|V:i ; | 1 / 2 0)(|V; C | 1/ ' 2 V , )> where D_ is the spacetime Fourier multiplier with weight 
||£| — [t[[; this allows one to reduce the null form estimate to a product estimate similar to the 
one used to prove l|61|l . See |12) . and also Lemma 115. II below. 
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Finally, we prove <j62j), {63}. From the embeddings H s - 3 / 4+ C C?H*, H 8 ' 1 ' 3 /^ C 
C^H*- 1 (from (JSZ])) and the trivial embedding ip-M-l C L^H^ 1 , it suffices to 
show the spatial product estimates 

ll/flljyr 1 ~ ll/llijy 2 + Nks 

and 

WfgWm- 1 ^IIVx/ll^+Nlfls-i. 

The hrst estimate follows directly from (fT4|) . To prove the second we use duality 
to convert it to 

\\fh\\ Hi -s<\\V x f\\ H y 2+ \\h\\ H1 - a . 

If / has Fourier support in the region |£| > 1 then this again follows from (TlH) . 
so we may assume / has support in the region |£| < 1. But then this follows 
by applying the fractional Leibnitz rule for (V^) 1-5 and the Sobolev embedding 

\\(v x y- s f\\ L ~ < ||V x /|| H y 2+ . □ 

9. Local existence 

We are now finally ready to prove the local existence result, Theorem 1 5. II 

Remark 9.1. This result is essentially in |10) . but for our application we need 
local well-posedness for large data as well as small. One cannot simply rescale 
large data to be small because the L? x norm is supercritical in (MKG-CG). By 
localizing in time and modifying the "6" index of the H s ' b norms we can control 
the "hyperbolic" component of the large data evolution for short times. However 
for the "elliptic" component of the evolution localizing in time does not help. One 
might try localizing in space, but this is tricky because the non-local Coulomb gauge 
has destroyed finite speed of propagation, and one would probably be forced to use 
local Coulomb gauges, cf. [3D] and [35]. Fortunately, the variational estimates in 
Section |3] will allow us to avoid these difficulties. 

Proof of Theorem [5A\ Let T > be chosen later. Let $[0], $'[0] be two smooth 
elements of MB([if s ]) obeying ([8]). Then by the global well-posedness theory in 
the introduction we can find smooth solutions <£>, $' to (MKG-CG) on the slab 
[— T, T] x R 3 . We shall show the estimates 

(68) ||$ - $'|| c o [ff8] < C(M)\\m - $'[0]||[H«] 

on this slab; the Theorem then follows by a standard limiting argument, using the 
remarks at the end of Section [3J to approximate rough data by smooth data. 

Define the norm \\$\\x on the slab [— T,T] x R 3 by 

(69) ||$|| x := ||$||^.3/4+ + ||V a! .t$||„.-i,3/ i+ +e||V x , t A || 2 1/2+ 

[-T.T] [-T.T] ^t 11 * 

where < e — s(M) <C 1 is a small number to be chosen later. We shall show that 

||$-$'IU<C(M)||$[0]-$ , [0]|| [i?s] 
which will imply flM]) by (E3) and ©• 
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To simplify the exposition we shall just prove the bound 

\Mx < C(M) 

but the reader may verify that the arguments below can be easily adapted to 
differences. 

We begin with the $ component of the X norm. By (frjU)) and ([50)1 we have 

(70) 11*11^.3/4+ + || V x , t §L\\ H -w + <M + T( s - 3 / 4 +>/ 2 £ \\NA\ , u 

[-T Tl \ — T T] L — * -T,T 

J=0,l,3 

From Proposition 18. II (restricted to [—T,T] in the obvious fashion) we can control 
the No, Ni nonlinearities: 

HNolU-w + IINill H .-i.. -i <C(e)\\n 2 x- 

[-T.T] [-T,T] 

On the other hand, from Strichartz (|58p we havS 

ii*iu;l S < \m\x 

Also, we have by Sobolev and (|22[) 

Po|U|i6 < \\W X A Q \\ C?L 2 

3 

(71) ^ 

Combining these estimates we can control the cubic nonlinearity N3: 

(72) HNallflpw < II* 3 ||l^ < ||*||i- iS < £ II* Hi- 

i=3 

Putting all of this together we obtain 

(73) ||t||„»,3/4+ + ||V a ,t$|| „.-!.»/«+ < M + r( s - 3 / 4 +)/ 2 C(e, ||$|| x ). 

[—T,T] [— T,T] 

Now we consider the A® component of the X norm. From the computations in (|71j) 
we have 

HV^olkf^ < 11*113: 

so we have 

||V !e , t A)|| 2; j fl - i /»+ < ||*||x + ||V s V x , t A || J .^- 1/ 2+. 
By (l50|) we thus have 

\\V x<t A \\ L2tHlJ2+ < ||*||| + ||N 2 || i?i? - I/2 + + ||N3|| i?H - I/3 +. 



It is here that we crucially make use of the hypothesis s > 5/6. It is likely that the methods 
of CuccagnajlO] can control the cubic terms N3 by more sophisticated estimates than Strichartz 
estimates in the larger range s > 3/4, but we will not need to do so here. We thank the referee 
for these points. 
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We estimate the second term using Proposition ^ . 1 1 (observing that s+s—2 > —1/2+ 
since s > 3/4), and we estimate the third term by the computations in (|72|l . to 
obtain 

(74) \\v x ,tAo\\ L2tH y 2+ <\m\x + \m\x- 

Combining this with our previous bound for $ we obtain 

MX < M + T( s - 3 / 4 +)/ 2 C(£, ||$||x) + eC(||$|| x ). 

If we now choose e = s(M) sufficiently small, and T = T(e,M) sufficiently small, 
we thus see that there is an absolute constant C such that 

||$||x < 2CM =*► \\<f>\\x < CM. 

By standard continuity arguments this implies that \\$>\\x < CM, as desired. The 
adaptation of this scheme to differences is routine (using (|23|) instead of (|22[0 and 
is left to the reader. □ 



10. Modified local well-posedness 



To finish the proof of Theorem II. 8[ it only remains to prove Proposition 15.41 Fix 

A), t, to- 

From a continuity argument we may assume a priori that 

(75) sup H[I$[A*]] < 3. 

te[t a -5/2,t +S/2] 

Our objective is to control the modified Hamiltonian at times t close to to- In order 
to do this we must obtain estimates on $ away from to- One obvious possibility is 
to combine (fT5"j) and Lemma 1431 this for instance will give the estimate 

(76) (dt + iIAo)I4>eC$Ll 

on the slab [to — 6/2, to + 6/2]. However these types of estimates (which basically 
place /<& in C^H 1 ]) will not be the right type of estimates (except for the low 
frequency component) for estimating the change in the Hamiltoniaii 15 !; it turns out 
that we need estimates in H s ' b spaces, which are not directly controlled by the 
Hamiltonian. 



One might hope to apply Theorem 15. 1[ since the regularity fl4"4l should be enough 
to put $ in [H s ] . However this is inefficient (basically because there is a significant 
loss in using (|41|) ) and in addition there are some low frequency issues, because 
of the error terms in Lemma 14.11 So we shall instead require a modified local 
well-posedness result which is adapted to the estimates arising from Lemma I4T1 



^Basically, the problem is that Sobolev embedding in three dimensions does not allow C^H 1 ] 
to control norms, so that nonlinearities such as ^Va;"!' cannot be placed in norms. To get 
around this we must use Strichartz embeddings and null form estimates, which in turn necessitates 
the use of H s,b spaces. We remark that in one dimension one can rely purely on Sobolev embedding 
and obtain global well-posedness results for nonlinear wave equations below H 1 without using H s ' b 
or Strichartz norms; see pp. 
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From (|44|) and Lemma \A. II we thus have 

(77) t(t )eCB(/- 1 J ffi + L?) 

(78) V,,# ) G CB(/- 1 J L2 + L 3) 

(recall that / is the identity on low frequencies). We now extend this control at 
time to to control on [to — <5, to + <5] with the following proposition, which is the main 
result of this section. 

Proposition 10.1 (Spacetime control on $). Adopt the assumptions of Proposition 
\5.4\ and suppose in addition K 3> 1 is a sufficiently large constant. We conclude 
there is a 8 with c(K)N ~ <S< C(K), such that 

(79) V s ,tt(*o) G KK{r l I$*I 5M+s] + C Q t Ll) 
on the slab [to — 8, to + 5] x R 3 . 

Remark 10.2. Note that the "6" index is now s— instead of 3/4+. This extra 
regularity in the b index will turn out to be helpful when proving (1451) . These types 
of estimates morally follow from the local well-posedness theory (or more precisely, 
the multilinear estimates underlying that theory) using tools such as [51 Lemma 
12.1], but for various technical reasons it is not feasible to do so directly, and so we 
have chosen instead the following more pedestrian argument. 

Proof. All our spacetime norms here will be on the slab [t Q — 8, t + 8) . Many of the 
unpleasant technicalities in the following argument will arise from the low frequency 
terms L^, and the reader is advised to ignore all the contributions from these terms 
in a first reading as they are not the essential difficulty 

By the continuity method and the smoothness of $ it will suffice to prove this under 
the a priori assumption 

(so) v*, t $ e 2KB(r 1 H^: 5 >to+5] + C t °L 3 ) 

(since this will imply that the space of 8 for which (|79l) holds is both open and 
closed, if 8 is restricted to be sufficiently small) . 

To prove this, we begin by estimating $ in various auxiliary norms. Split $ 
smoothly into a low frequency component $i ow supported on |£| < 2, and a lo- 
cal component $ loca i supported on |£| > 1. From (|80)) we have 

V,$ local € CKB(r'H^: 5 ta+s] + C t °L|), 

which implies that 

$ local e cKB(r x Ht t > o s : s to+s] + C t °L 3 ). 

Also, from Sobolev embedding, (1571) and the low frequency restriction we have 

5 t $ low G CXB(C t °L 3 ) 
while from (|77p and Sobolev embedding we have 

^ ow (to)eCB(L 6 1+ Ll) 
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so by the fundamental theorem of calculus we have 

$ low £CKB(C?L 6 1+ C°L 3 2 ). 
Combining these estimates we obtain 

(81) * e CKB{r l H^Z Sit0+s] + C° t hl + C° t h\). 
From Strichartz (|58[) and (TT6l) we have in particular that 

£ e CJrB(L°L°). 

Our next task is to obtain estimates on Aq. From (|5T|) . (j5"7| . (HIT) and Sobolev 
embedding we have 

while from ([80]). (|57]l . (|4T) we have 

From Lemma |2~TI (jT5J) , and (fTTj) we thus have 

t^eCA^B^ii- 1 ) 

so by (|2"Tj) we have 

(82) A € CA- 2 B(C* 4 iji). 
From Sobolev embedding we thus have 

(83) A € CA- 2 B(C t °L^) C CK 2 B(L 6 t L 6 x ). 

Combining this with our L\L X bound on $ we thus have $ G CK 2 B(LlL 6 x ). In 
particular we can control the cubic nonlinearity N3: 

(84) N 3 eCK 6 B(H^_ s>to+5] ). 

To control the bilinear nonlinearities No, Ni, N2 we need the following variant of 
Proposition 18.11 

Lemma 10.3 (Modified bilinear estimates). 

(85) ||»?(t)^V a!l t^||7-ajo,o < 7V° + ||0|| / -i H i, s - \\V x ,ti>h-i H o,.- 

(86) MtXdtAoWh-iHO,.-! <N 0+ \\d t A \\ H1/ , + ,o\\4>\\i-^- 

(87) \\i}(t)M9t<l>)h-w-* Z N 0+ \\V x , t A \\ H1/2+ ,o\\d^\\ I -i H ^~ 
h^No^^ll/-!^-! < N 0+ \\<f>\\ I -i B i..-\\tl>\\ J -i Hl ,.- 



Proof. We first prove ([88]) . By an appropriate Fourier decomposition^ it will suffice 
to prove the estimate in the following four cases: 

• (bounded-bounded interaction) <j>, ijj are supported on the region |£| < N/2. 

• (high-high interaction) 0, tp are supported on |£| > N/10. 

• (bounded-high interaction) </> is supported on |£| < N/5 and tp is supported 
on |f | > JV/4. 



°For instance, one can decompose both and ?/> into three components with frequency support 
< N, ~ iV, > AT respectively, in such a way that each of the nine interactions falls into one of the 
four categories described below. 
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• (high-bounded interaction) ip is supported on |£| < N/5 and <f> is supported 
on |£| > N/4,. 

By ((64j) (noting that s— > 3/4+) we have 

||N„(6 VOIIz-w-i £ ^ 1 -"||N o (0 > V)|U.-i..-i < JV 1 -||0|| H ...-||Vl|ir...-. 

If ^ has high frequency then ||^»||h s . s - ^ N s \\<p\\j-i H i,s- , otherwise we just have 
Il0llff s s - ^ ll^lli- 1 ^ 1 "*-- Similarly for ifj. This allows us to deal with the high- 
high, high-bounded, and bounded-high cases. In the bounded-bounded case, all 
the / operators are the identity, so it suffices to show 

||»?(t)No(^)||jj°..-i < W\\ B1+ ..-\\4>\\ B i..- 

where we have used the iV 0+ to gain an epsilon regularity on the bounded fre- 
quency function cf>. We crudely estimate the iJ°> s_1 norm by the L 2 L 2 norm and 
crudely write the null form No(</>, ip) as 0(cf>W x ip). The claim then follows from the 
Strichartz embeddings H 1+ ' s - C L 2 t + Lf and H°< s - C C^L 2 X from (|58]). 

Now we prove (|86|) . (f87|) . In the bounded- high and high-high cases this follows 
from (|62|) . (|63| by the same arguments as before (indeed, we may even weaken the 
jji/2+,o norm t / _1 i7 1 / 2 +'°). It remains to consider the bounded-bounded and 
high-bounded cases. We begin with the bounded-bounded case. Estimating the 
jjO,s-i norm ^ t ne L 2 L 2 norm, it suffices to show 

\\ v (t)(d t A )<i>\\ L 2 L , < \\d t A \\ m/2+ ,oU\\ H ^- 

and 

\\ v (t)A (d t <j>)\\ L 2 Ll < \\V x , t A \\ Hl/ 2+,o\\dt<i>\\m,*-- 

For the first estimate we use the Sobolev embedding _ff 1 / 2 +'° C L 2 L X and the 
Strichartz estimate H ,s ~ C C^L X from ([55]) . For the second we use the Sobolev 
embedding ||Ao||x,3i«. < || V X) tA)|| #1/2+, o and the Strichartz estimate H s ~ C 
Cj'L 2 ,. The same argument also deals with the high-bounded case; one can use the 
theory of paraproducts to cancel the factors of I. 

Finally we prove (|85|) . In the high-high case we compute using (|6Tj) with s' = s" = s: 

UV X A>\\ I -2 H 0.0 < N 2 - 2S \\4>W x Mh^~^ < N 2 - 2S M H s, s - IIV^IIffa-La- 

which is acceptable in the high-high case. In the high-bounded case we use (p)Tj) 
with s' — s and s" = 1: 

UV^h^jjo.o < AT 1 — H^V^Hir— x.o < Ar 1 - 5 ||0|U., s -||V :c ,^|| H o (i - 

which is acceptable. For the bounded-high case we similarly use (|6Tj) with s' = 1 
and s" = s. Now we turn to the bounded-bounded case. Here we take advantage 
of the additional iV 0+ factor; it suffices to prove 

\\-q{t)(jN x .t^\\Hafi < M.ffl+.— IIVjB.i^llffO,.-. 

But then this follows from the Strichartz embeddings H 1+ - s ~ C L 2+ and 
H o,s- g c o L 2 from ^ n 
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We can now control N2. Indeed we claim 

(89) N 2 eCN°+I- 2 KB(H°>°_ sto+s] ) 

To prove this we multiply (15U[) and (f5Tj) . To multiply the two H s ' h spaces we 
usa 1 ! (j85[) . To multiply the spaces we use (fl7[) . (|T6)) (indeed we can get into 
H°'° = L 2 L 2 for these terms). 



It remains to handle the cross terms when H s ' h is multiplied against a L R . By ([80 
and (|8Tj) it suffices to prove the embedding 



(90) (r'H - ) ■ C° t V R C I^H 00 
for L R = L^, L2. But this is implied by 

(91) (r^)-L|cr^ 



which easily follows from ([TBI and a decomposition in to high and bounded fre- 
quencies. 

From P ]) . (|gg [) . ([5tJ fl we have 

V x V,, t A g CiV +x 7 B(/- 2 ^ 5ito+5] ). 

Split ^4o into bounded frequencies |£| < 1 and local frequencies |£| > 1/2. For the 
local component we invert V in the above, while for bounded frequencies we use 
Sobolev embedding, to obtain 

(92) V x , t A G CN°+K 7 B(r 2 H^_ S to+s] + L 2 L?). 



Now we control No, Ni, N3. Indeed we claim 

(93) No, N 1; N 3 G CNO+K^Bil- 1 ^*:^ + L 2 t L 3 2 ). 

The cubic term N3 is acceptable by ([84[h so we turn to Ni. 

To control the A^dt^ component of Ni, we use (f5U]) . The 2KB(C?Ll) component 
of will be acceptable from ([83[) . as this places this contribution to A^dt^. in 
C t °L 2 , which easily embeds into I~ 2 H^ g _ s to+S y So it suffices to show that 

A ■ 2KB(I-'H^: 5M+5] ) c CN°+K 20 B(I-'H^:l to+s] ). 



Split Ao into low frequencies |£| < 2 and local frequencies |£| > 1. For local 
frequencies we can use flHU) and (|HB (observing that I- 2 H 10 C ffVa+.o^ For low 
frequencies we have 

^0 G Cif 2 B(C t °L^) 
from (f8"2"[) , and the claim will follow from ([9"U[) . 

Now we control the 9 f Ao$ component of Ni. To do this we multiply ([921) with f|81|) . 
The product of I -2 !! 1 ' and I _1 _ff 1 ' s_ is acceptable from ([86[l (again observing 

g ijl/2+,0) product of L 2 L 6 ^ £^3 j g ^ £2^2 g J-ljfO.a-l 

by ([T7]) . while the product of L 2 h\ and C°L^ is similarly in L 2 \\. For the cross 



The estimates there were phrased for cutoff functions r] centered at the origin, but it is clear 
from time translation invariance that one can also use cutoff functions centered at to . 
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terms between ffVa+.o an d C?L P R we use (90]) (since H^ 2+ -° C I^H ' ), while 
for the cross terms between I -1 !! 1 '* - and L 2 we use the embedding J -1 !? 1 ' 8- C 
C^/- 1 ^!) followed by (jnij) . 

Finally, we control No = No($, $). We decompose $ smoothly into a low frequency 
component $j ow supported on |£| < 4 and a local frequency component $i oca i 
supported on |£| > 2. The contribution of N ($ local , $ local ) is acceptable from (|88|) 
and ([81]) . For the remaining terms we shall not exploit the null structure, and just 
write N ($, $) crudely as 0(^V X ±). 

Consider $ low V x ,t£ local . From (HO) we have V M £ local G CA-B^- 1 ^^^,), 
while from (81]) and ([16]) we have $j ow € CAB(C t L4). The claim then follows from 
PJ. 

Now consider !k\oc a \^ x,t!k\ow From (80]) and (16]) we have V Ijt $ low G CAB(C t L4), 
while from (81"]) we have $i oca i G CA'B(/ _1 iJj 1 t ' o s J (5 to+ ^)- The claim again follows 
from (Hill- 
Finally we consider $i ow V x .t$ low . As mentioned in the previous paragraphs we 
have $ low € CKB(C?L 6 4 ) and V^tJLiow G C*AB(C* t °L^). The claim then follows 
from (T7]) . 

This completes the proof of (|93p . From ([50]) we then have 

From (60]), (19]), (78]) and the linearity of □, we thus have 

V x>t $ € C^+^+A^BU- 1 ^;- to+s] +C%1). 

If we choose A sufficiently large, and S sufficiently small depending on A, N (but 
with 5 > C(K)N°-), then O follows, as desired. □ 

We apply the above Proposition with a fixed A sufficiently large, i.e. with A 
an absolute constant, following our conventions for such constants. In particular 
henceforth all implicit constants are allowed to depend on A. As a corollary of the 
above argument (specifically (79]) . (l8T]) . ([92]) . ([83]) ) we have the estimates 

(94) |€CB(7- I ffJ; s _- ito+j] +^) 

(95) V X)t $ G CBC/- 1 ^" ito+5] + C?L§) 

(96) A GCB(/- 2 ^ o _ ato+(5] +C?L?) 

(97) ^Ao G CB(I-*H^ S to+s] + L 2 t Lf). 

In other words, ignoring the technical low frequency issues, /$ lives in If 1 ' 8- and 
J 2 Ao lives in H 2,a , and similarly for the time derivatives (but with one lower order 
of regularity, of course). 
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In the remainder of the paper we use the estimates (|94l) -(|97 ]) to obtain (|45j) . 



11. Differentiating the Hamiltonian 



Having obtained control on Aq , <I> on the interval [to — 6, to + 6] , we are now 
ready to begin the proof of (p±5|) . We shall use the real inner product (u, v) :— 
Re Jp3 u(x)v(x) dx throughout this section. Since m is real and symmetric we 
observe that / is self-adjoint: (Iu,v) — (u,Iv). Similarly for I . 

Fix Te [to — 5/2, to + S/2]. By the Fundamental theorem of calculus it suffices to 
show that 



(98) 



T ^H[J$[t]] dt 

to dt 



< 1 



AT(s-i/ 2 )- 



Our next task is to expand the expression 

(99) J t R[im - 

If the / were not present then would vanish. With the / present, does not 
vanish completely, but we will be able to express (|99p in terms of commutators of 
/ and other operators. 

Before we do so, let us begin with a heuristic discussion, ignoring the elliptic term 
Aq and the null structure. Since (MKG-CG) is roughly of the form 

(100) □$ = 0($V M $) +0($$$) 
and the Hamiltonian (|13l) is roughly of the form 

H[$]= / 0(V t . x $Vt. x $) + 0($$V M $) + 0($$$$) 

it seems reasonable to expect an identity roughly of the form 

(101) <9 t H[$] = ($ t , □$ - 0($V t , x $) - 0($$$)) 

for arbitrary $ (not necessarily solving (MKG-CG)), since we know in advance that 
the Hamiltonian must be preserved by the flow (MKG-CG). In particular we expect 

d t H[/$] = (I$ t , □/$ - 0(/$V t!:E /$) - 0(iW$/$)). 

On the other hand, by applying / to (|100[) , we have 

□/$ - 70($V Xit $) - I0($$$) = 0. 

Inserting this into the previous equation, we expect to split 9fH[/$] as two com- 
mutators: 

(102) 9 t H[/$] = (/$ t ,0(/($V t , 2; $)-/$V t!:E /$))-|-(/$ t ,0(/($$$)-/$/$J$)). 



We now begin the rigorous argument. The rigorous form of (|101|) is 
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Lemma 11.1 (First variation of Hamiltonian) . If $ is arbitrary (not necessarily 
solving (MKG-CG)), then 

^H[$[t]] - -(F fl0> d a F^ + lm(0i^)) - (D cf), D a D a (j)), 
where D a and F a p were defined in ([T]) 7 @. 

Observe that this quantity vanishes (as expected) if <J> solves (MKG), and in par- 
ticular if it solves (MKG-CG). 

Proof. We recall the stress- energy tensor 

T a p ■= F^F^ - -^F^F^ + Rc(D a 4>D^) - ^rj a , 3 Rc(D ^D^) , 
where r) a p is the Minkowski metric. From (|13l) we see that 

H[$[*]] = / Too(x,t) dx. 

Thus 

&H[$[i]] = - / d a T aQ (x,t) dx. 

To compute the integrand we observe that 

d a Ke(uv) — Ke(D a uv) + Ke(uD a v). 
Using this we can expand d a T a Q as 

(d tt F£)iV+^3 Q F 0p -i(d F^)F^+^ 

Collecting terms and relabeling (using the anti-symmetry of F), we can rewrite the 
above as 

-iF^(aoF^+a M F, o +^F OM )+(a tt F Q ^)F OM +Rcp"Z? Q ^^)+Re(C Q 0[^,C o ]0). 

The first term vanishes from the Bianchi identity dF — ddA — 0. The last term can 
be simplified as [D a , Do] = iF$. After a little more collecting terms and relabeling, 
we obtain 

d a T a o = (d a F a x + lm(0^)F Qll + Re{D a D a 0^) 
and the Lemma follows. □ 

In the Coulomb gauge ([7]), we can use this lemma to rewrite (l99l) as 

4h[/$[*]] = -(djlAo-dtlAj, aiA ] +d 3 d t IA o +lm{I0 J I^))-(D Q I^D 3 D,I ( j ) ^D Q DoI^ 

where D a :— d a + i(IA a ). On the other hand, by applying / to (MKG) we have 
ulAj + djdtlAo + llm(0^>) = 

and 

I{D Q D a 4>-D J D J cp)=Q. 
Thus one can write (|99p as a linear combination of the commutator expressions 

(103) (djIAo - d t IA j ,Hm(0^) ~ Im (10 j I(f>)), 
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(104) (DoIfalDoDo^-DoDoIt), 
and 

(105) (DoI<t>, IDjD j( j) - DjDjI(f>). 
This should be compared with (|102[) . 

We now break up (|103| . (| 104[) . (j 105[) further. We introduce the nonlinear commu- 
tators 

[/.No] := /N ($,$) - N (/$,/$) 
[J,Ni] := (/(Mo*) - a t /A /$,/(A 9 t $) - M /M) 
[/, N a ] := /(tVxt) - /tV a /t 
[J,N 3 ] := /($ 3 ) - (I*) 3 
We also define the "mollified time derivative" 

D $:= {d t IA,D I<t>). 



In later sections we shall prove the estimates 



(106) 



(D o $,0([/,Nfc])) dt 



to 



< 1 



for k = 0, 1, 3, as well as the variant 



(107) 



(V x IA o ,0([I,N k ])} dt 



N( s -V 2 )- 



< L_ 

~ A/-(s-l/2)- 



for k = 2, 3. (In fact, with the exception of the null form estimate (I106P with k = 0, 
we will be able to obtain a decay of 0{N^ 1 / 2+ ).) For now we show how these 
estimates imply (fTD5|l . ([TM|> , (TITj5l) are bounded by 0(l/iV( a-1 /2)-) as required. 

We begin with (|103[) . Consider the contribution of OjIAq. We write this term 
crudely as 

(V iC /A o ,0([/,N 2 ]) + 0([/,N 3 ])) 
which is acceptable by (|107[) . 

Now consider the contribution of d t IAj. By ([7]) we may freely insert a projection 
P on the left term of the inner product, and hence on the right by self-adjointness. 
From the definition of the null form No we can thus write this contribution as 

(a i /A ) 0([/,N o ]) + 0([/ ! N 3 ])) 
which is acceptable by (|106|) . 

Now we expand out (|104[) . Observe that IDoDo<fi— DqDqI4> can be expanded as a 
linear combination of [/ , Ni] and [/, N 3 ], so the claim follows from (|106[) . 

Now we expand out (|105[) . From ([7]) we have 

IDjDj^ = IA<j) + 2iI(PA ■ V x (f>) + I(\ A\ 2 (f>) 

and 

DjDjIfi = IA<t> + 2i(P/A) • (TV x <j)) + \IA\ 2 I(f) 
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so by the definition of the null form No we have 

ID S D S 4> - DjD 6 Ic]> = 0([I, N ]) + 0([7, N 3 ]) 
and the claim follows from (|106p . 

It remains to prove (1106p . (I107|) . We shall do so in later sections, but for now we 
give some estimates on Do^. 

Lemma 11.2 (Dq^ estimate). We have 

(108) D $ G CB(H^: s/2 to+s/2] + C?£? ). 

Proof. The bound on dtIA follows from (|95p and (|16p . Now we bound DqIc/). We 
split DqI4> = {Dolcf))\ ow + (Dol<p)\ oca \, where the low term has Fourier support in 
|£| < 10 and the local term has Fourier support in |£| > 9. 

The low term is acceptable from (|75|) and Sobolev, so we consider the local term. 
We split D I<j) = I(j>t + HA n I(j). The I<j> t component is acceptable from (p?5j) . so it 
remains to control the local frequency component (IAoI(f>)i OC al of IAoIcj). Because 
this is a lower order term, regularity will not be a major problem, but there will be 
some other technical issues related to the time truncation. 

Let rj{t) be a bump function adapted to [t a — S, t + 8] which equals 1 on [to — 
5/2, to + 5/2}. It will suffice to show that 

\Ht)(IAoI<P)l oca l\\ H a,s- < 1. 

From the crude estimate — < |£| + |r| when |£| > 9, we have 

|| ry(t) (JA J0)ioca.i || ffo,.- < ||V a , t (»/(t)J>4o^)||£^. 
By the Leibnitz rule, it thus suffices to show the quantities 

(109) \\rf(t)IAoI<l>\\ L , Ll , \\r,(t)(V x ,tIAo)I(f>\\ L , Li , \\ V {t)IAo{V x , t I^)\\ L , Ll 
are bounded. 

From (|H7| and Sobolev we have the crude bound 

Ao,V Xlt A eCB(L 2 t L 6 x + L 2 t L 3 x ) 
while from (|57|) and Sobolev we have 

H G CB{LfL%) n CB{LfLl). 

The first two norms of (| 109[) are then bounded by Holder (since I is bounded on 
all the above spaces). For the last norm we instead use (jTHJ) , and Sobolev to 
obtain 

Ao G CB(L 2 t L™) n CB{L\L%) 
while from (|95|) , (|57)l and Sobolev we have 

\7 x ^eCB(C°Ll + C?L 3 x ). 
The claim again follows from Holder. □ 
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The only remaining task is to establish the estimates (|106[) , (|107|) for various values 
of k. 

12. The cubic commutator [I, N 3 ] 

We first prove the estimates (|106j) . ()107|) for the cubic commutator [J, N3], which 

is the easiest to handle as there are no derivatives. Indeed we will not need the full 

strength of the commutator structure here, and we can use very crude Lebesgue 

space estimates. In this case we will obtain a decay of N~ x / 2+ instead of just 
1 

N (s-l/2)- ■ 

Smoothly divide $ := ^bounded + ^high, where $high has frequency support on 
|£| > N/10 and ^bounded is supported on |£| < N/5. 

We need the following Strichartz-type estimates. 

Lemma 12.1 (Strichartz estimates). On the slab [to,T] x R 3 , we have 

(110) D <I>, VJA e CB(C?L X + C° t L 2 x + L 2 L 9 J 2 ) 

(111) $eCB(L^nL^nft) 

(112) $ high € CiV-V2+ B(C L 2 ) n B (C7 t °L 3 ) n CN-^+BiLjLt) 

Proof. The bound on Do* comes from Lcmma fl 1.21 and the crude estimate H°' s ~ C 
C^L 2 . The bound on V x IAo comes from (|96|) and the crude embedding I -1 !! 1 ' C 
L 2 L 9 J 2 arising from Sobolev embedding (since s > 5/6). 

For $, the bounds in (|111[) come from (|94|) , (|16|) , and the Strichartz estimates 
j-i H i,s- g ff .,i/a+ c L *Ll,LiLl,L* /3 Ll 

from (|58| . 

For Ao, we argue differently. The low frequency component is acceptable from 
(f96|) . (fT6| . For the medium and high frequency components, the estimate (|96|) 
and Sobolev gives € CB(L(L^°), while ([57)) and the fundamental theorem of 
calculus and Sobolev embedding gives A G CB(C t °L 2 n C t °L 3 ). The claim then 
follows by interpolation. 

For $, the bounds in (|112[) come from (|94|). the observation that 
and the Strichartz embeddings (from (|58p) 

For Ao, we have from (|96|) and Sobolev that 

II^MghlkjZ- < ||Ao, hig h|| ff 3/2 + ,o < iV-^+Po.highHj-^.o < 7V-l/2+ 5 

1 [t -Mo+< 5 ] [*o-Mo+ 5 l 
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while from (|97|). the fundamental theorem of calculus we have 

||A),high||cPL 2 ~ II^O,high(^o)lli? + || <9 t A),high | H °.° 

* [*o-<Mo+< s ] 

< AT-V2(|| V:cj4o ( to )|| + U^ j^ll M ) 

[t ~8,t Q + 5] 

< N~ 1/2 . 

The claims then follow from interpolation. □ 

We now prove (|106|) . (|107|) . From (|1 10[) and Holder it will suffice to prove that 

[/, N 3 ] G CB(LjL 6 J 5 n n 
We expand out [/, N3] as the sum of eight terms 

[J,N 3 ]= I($a$b$c) - (/$„)(J*6)(/# C ). 

a,b,c£ {bounded, high} 

When a = 6 = c = bounded then / acts like the identity everywhere, and the 
summand vanishes. Thus it will suffice to show that 

/(<i> Q <i> b <i> c ), (J# )(J# 6 )(/3c) G CBflJif) n CB(L^) n cb(l?4/ 7 ) 

whenever at least one of a, b, c is equal to high. By symmetry we may assume 
c = high. But then the claim follows from (|112l) and Holder. (The operator 

I and the projections $ i-> ^bounded) l— ^ ^high are bounded on every translation- 
invariant Banach space. To get L\l^J' 1 , one places two factors in LfL^., and the 
last factor in an interpolant of C®l? x and C®L\ .) This completes the proof of (|106[) . 
(|f 071) for the cubic commutator [I, N3]. 

Remark 12.2. It is in fact possible to use more Strichartz estimates to improve the 
estimate for [J, N3] even further to N~ 1+ ; this would be consistent with the results 
for the cubic nonlinear wave equation in |17j . The numerology is as follows. As we 
saw above, it suffices to put the three factors <& in N3 = $ 3 in L\L\. The Strichartz 
embedding (|58|) allows this if $ is in 

^2/3,1/2+^ But $ 

is in H (for medium 

frequencies at least), so there is 1/3 of a derivative to spare. Since there are three 
factors of $, we thus see that there is about a full derivative of surplus regularity 
in [I, N 3 ]. From (| 1 1 5|) one then expects to extract a gair0 of N~ 1+ , in principle 
at least. 



13. Frequency interactions of bilinear commutators 

In the remainder of the paper we will prove the estimates (|106|) or (|107[) for the 
bilinear commutators [I, No], [I, Ni], [/, N2]. Ignoring derivatives and null forms 



° Admittedly, in the above argument only one of the three factors <I> could be assumed to be 
high frequency, however one should still be able to obtain the full gain of A r_1 + by playing around 
with the Strichartz exponents (e.g. putting the high frequency factor in L\ Lj 0- and the other two 
in C®L^~), or perhaps by using commutator estimates as we do with the bilinear commutators 
below. 
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(which will have no bearing on the discussion in this section), all the expressions 
on the left-hand side have the form 



(113) 



(u(t),I(v(t)w(t)) - I(v(t))I(w(t))) dt 



On the other hand, the functions u, v, w which appear here have different behavior 
at low, medium and high frequencies. The purpose of this section is to decompose 
the above trilinear expressions in terms of these three frequency components. 

We smoothly split u — uiow+^mcd + Uhigh, where ui ow is supported on |£| < 20, w mc d 
is supported on 10 < |£| < N/5, and Whigh is supported on |£| > N/10. Similarly 
decompose v and w. We can then split (|113j) into 27 terms of the form 



(114) 



(u a (t),I(v b (t)w c {t)) ~ I(v b (t))I(w c (t))) dt 



where o,i,cg {low, med, high}. 

This may look like a lot of terms, but fortunately most of these terms are zero. For 
instance, if neither of b or c is high, then I acts like the identity and (|114[) vanishes. 
So we may assume at least one of 6, c is high. 



Next, we claim that if one of a, b, c is low frequency, then (|114[) vanishes unless the 
other two indices is high frequency. To see this, suppose (for instance) that a was 
low frequency and b was low or med frequency. Then we can integrate by parts and 
rewrite the above as 

i-t 

(I(u a (t)),v b (t)w c (t)) - (I(u a (t)Iv b (t)),w c (t)) dt 



at which point all the Is act like the identity and so (|114|) vanishes. Similarly for 
other permutations. 



From this discussion we see that of the 27 terms in the decomposition, only 9 are 
non-zero, and they are listed in Figure Q] 

We now discuss qualitatively how each of the six cases in Figure Q] will be estimated. 
In all of the cases, the main challenge in proving (|106[) or (|107j) is to obtain the decay 
factor jV(3 _ 1 1/2) _ ; it is relatively straightforward to prove these estimates without 
this decay factor, but then Proposition 15 .41 will only let us control the Hamiltonian 
for times T = 0(1), which will not give us global well-posedness for any H^.. 



To obtain this decay we must use the fact that the high frequencies are small if 
measured in rough norms. In particular, we will make frequent use of the simple 
estimate 



(115) 



\\u\\ m -e<N- e \\u\\ H s 



valid for all u such that u is only supported in "high" frequencies |£| > N, and all 
9 > and s G R. Thus if there is a high frequency term present, we can sacrifice 
some of its regularity to obtain the desired gain in N. 
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a 


b 


c 




low 


high 


high 


Lemma Il3.ll 


high 


low 


high 


Lcmmall3.1l Lcmmall3.2l 


high 


high 


low 


Lemma 1 13.1 1 Lemmall3.2l 


med/high 


med 


high 


Lemma 113.31 


med/high 


high 


high 


<|120p. Lemma H3JI 


med/high 


high 


med 


(|120p. Lemma [13,31 



Figure 1. List of possible frequency interactions for (|113[) , and 
the Lemmas and estimates which are useful in each case (although 
for the null form No the analysis is more complicated than the 
above table suggests). In most cases, it will not be so important 
to distinguish between the cases a = med and a — high. One can 
also eliminate the med-med-high case by Fourier support consid- 
erations, though this does not significantly simplify the argument. 

This will be fairly straightforward in the first three cases of Figure [TJ when there 
are two high frequency terms, because the low frequency term is smooth and easily 
estimated. In fact for these low frequency cases one can usually improve the decay 
estimate to N~ x / 2+ or better. We use the following two lemmas to handle the low 
frequency cases. 

Lemma 13.1. We have 

CHI) £ IMIi^Hj 1 IMI l f r \\ w \\li 3 Hx 3 

for any u <E Lf H* 1 , v € Lf L P R , w <E Lf H^, 3 with 1 < p,qi,q 2 ,Q3 < oo, l/q x + 
l/<?2 + I/93 < 1. si + S3 > 0, and R = O(l). Similarly for permutations of u, v, w. 

Proof. By (|16l) we may take p = 00; by lowering S3 if necessary we may assume 
S3 = — si. By a Holder in time (and discarding all appearances of the operator 
/, which is bounded on every Lebesgue and Sobolev space) it suffices to prove the 
spatial estimate 

\{u,vw)\ < IMI^iIMIl-IHIh-i- 

We perform a Littlewood-Paley decomposition u = J2 k>0 where ut is supported 
in the region (£) ~ 2 fe . Similarly split w = X)fc / >o u '' c '' Observe from the Fourier 
support of v that (uk,vwk>) vanishes unless k' = k + 0(1). By Holder we may 
therefore estimate the left-hand side by 

IKIUjNl^IK'IUs 

fc,fc'>0:fc'=fc+O(l) 

which is comparable to 

fc,fc'>0:fc'=fc+O(l) 

The claim then follows from Cauchy-Schwarz and the almost orthogonality of the 
Uk and of the Wk' in Sobolev norms. 
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The claim for permutations follows since the expression (u(t),v(t)w(t)) is essen- 
tially invariant under permutations (the conjugation being irrelevant for the above 
norms) . □ 

This lemma does not give the decay of ^ J 1 ^)- directly, but we shall combine it 
with (|115[) to do so if there is enough surplus regularity in the u and w variables. 
However, even when this surplus regularity is unavailable we can still obtain this 
decay if there is a commutator structure. More precisely: 

Lemma 13.2 (Commutator estimate). We have 

for any u £ Lf~L%, v £ Lf 2 L p R> w £ Lf 3 Ll with 1 < p, qi,q%, q% < oo, 1/qx + l/q 2 + 
l/gg < 1, andR = 0{l). 

Proof. By (|16[) it sufhces to take p = oo. By Holder's inequality in time, it thus 
suffices to show the spatial commutator estimate 

\(u,I(vw) - vlw)\ < w^NUsIMIl-IIIHIz* 

for all u e L\, v £ L^, w £ L 2 X . (Observe that Iv = v for v £ Lg). 

Let us first assume that w is supported in the annulus |£| ~ M for some dyadic M\ 
we will sum in M later. If M <C N then I(vw) — v(Iw) = vw — vw = 0, so we may 
assume M > N. Under this assumption, we will prove that 
(116) 

\(u,l(vw)-vlw)\ <M-V(M)|M| L 2|H| L ~|H| L 2 < M^IMI^IMIj-IHI^; 

the claim then follows for general w by a Littlewood-Palcy decomposition and the 
triangle inequality. 

It remains to show ()116|) . Fix M. We use Plancherel to write 

(I(vw) - vlw)\0 = / M£i + 6) - ro(&))fi(£i)u;(£ a ). 

From the support conditions on v and w we may insert some cutoff functions 

(I(vw) - vlwJiO = [ (mfo + 6) ~ m(6))o(ei)6(f a )fi(6)'fi(6) 

where a is a bump function adapted to |£| < 1 and 6(^2) is a bump function adapted 
to |£| ~ M. 

From the mean value theorem and the smoothness of m we see that 

m(£i + £,) - m(fc) = 0{M- l m{M)) 
on the support of a(£i)&(£2)- Moreover, we may write 

(m(6 +6) - m(6))a(&)6(6) = M-imOMK^k) 
where c is a bump function of two variables adapted to the region < 1, |^ | ~ M. 



GLOBAL WELL-POSEDNESS OF MAXWELL-KLEIN-GORDON 



41 



By inverting the Fourier transform again, we obtain 

(I(vw) — vlw)(x) — M m{M) / / c(y, z)v{x + y)w{x + z) dydz 

where c is the inverse Fourier transform of c. From the bump function estimates 
on c and standard integration by parts computations, we obtain the bounds 

\c( y) z)\<M 3 (y)- 100 (Mz)- W0 . 

Thus by Minkowski's inequality followed by Holder's inequality, 

(117) ' 

f \\I(vw) - vIw\\ L 2 < M- l m{M) J R3 J R3 M 3 (y)- 100 (Mz)- loa \\v{- + y)|UK- + *)IU a dydz 
and the claim (|116[) then follows from the Cauchy-Schwarz inequality. □ 

Because of this Lemma, the low frequency terms will be quite minor in comparison 
to the medium and high frequency interactions, although they will unfortunately 
occupy about half of the cases in the sequel. For the medium and high frequency 
interactions we shall often use the estimate 

Lemma 13.3 (Trilinear estimate). Letqi, qi, q 3 , be such that l/<ji + l/<j2 + 
l/</3 < 1. Then 

(i) We have 

032) < \\u\\ L 1 t l H °l\\v\\ L <,2 H p\\w\\ L V3 H °3 

whenever < s\ + S2, S2 + S3, S3 + si and si + S2 + S3 > 3/2. 

(ii) Under the same assumptions, we also have the variant 

033) < \H\ L n H ;i\\I~ l v\\ L ^ B -2\\Iw\\ L m H 's. 

(iii) Let s\,S2, S3, s' 1; s' 2 , s' 3 be such that s\ + S2 > 0, —S3 < min(si,S2), 
—S3 < S1+S2 — 3/2, s'i+s'2 > 0, —s 3 < min(s' 1 ,s' 2 ) and—s' 3 < s^+s^ — 3/2. 
Then 

(118) (El < \\Iu\\ H °i \\v\\ H s 2 \\w\\ H > 3 + \\u\\ , || J«|| , \\Iw\\ 4 

Proof. The first inequality is an immediate consequence of Holder in time, (I14|) and 
duality. 

To prove the second estimate, we use a Holder in time and the triangle inequality 
to reduce to proving the spatial estimates 

\(u,(Iv)(Iw))\,\(u,I(vw))\ <||u|| Hjl 117-^H^ ||HIh;- 

The former estimate again follows from (|14l) and duality, noting that I~ 1 v controls 
Iv in the H^ 2 norm. To handle the latter estimate, we rewrite 

(u,I(vw)) = (Iu,vw) = {(Iu)v.w) — (I ((Iu)v),Iw), 

and reduce to showing the bilinear estimate 

\\i-H(i»m B --s<h\\ H :4r 1 *\\H:'- 
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We may assume that u, v have non- negative Fourier transform. From the pointwise 
inequality m(£ + rf)^ 1 < m^)^ 1 + to^) -1 and Plancherel, we see that I -1 obeys 
a fractional Leibnitz rule, so it suffices to show that 

IMI H -»3 < WAh^W^vWhI 2 - 

and 

\\{Iu){I- l v)\\ H -s 3 <||«|| fl ;i||/- 1 t>|| fl s». 

But these both follow from (|14p (observing that u controls Iu and I~ 1 v controls v; 
the conjugation is irrelevant). 

To prove the third estimate it is enough to prove 



\(u,I(vw))\ < \\Iu\\ H >i \\v\\h°2 \\w\\h>3 

and 



(119) |<t*,JwJ«;>| < |M| H4 ||/ W || ff4 ||Hl ff 4 

But, by integration by parts, duality and (|14[) we have 

\{u,I(vw))\ =\(Iu,vw)\ 

< \\Iu\\h'i II^IIh-^i 

< ||Ju||iyn \\v\\h s 2 \\w\\h' 3 

and 



\{u,(Iv)(Iw))\ <\\u\\ HS ,\\(Iv)(Iw)\\ H -,, 

<\M H 4 \\M H 4 \\iw\\ H .> a 

and the claim follows. □ 



As with Lemma ri3.1[ these estimates when combined with (|115p will give the desired 
decay in N provided that there is enough surplus regularity in the high frequency 
factors. 

The "high-high" interaction, when b, c are both high, will also be relatively easy 
to handle because there are two high frequency terms in which one can sacrifice 
some regularity. (It will turn out that the a term usually has no surplus regularity.) 
The "medium-high" or "high-medium" interactions will be more delicate however, 
especially if the function associated with the "high" frequency is quite rough (e.g. 
V x> t<^>). In this case there may be no surplus regularity on the high frequency factor 
to use, but to compensate for this the medium frequency factor will have quite a 
bit of surplus regularity. To exploit this we will use the commutator structure, and 
specifically the Holder continuity (or mean-value theorem) estimate 

(120) m(& + 6) - m(&) = 0((|-) e m(£ 2 )) 

S2 
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for any medium frequency £1, high frequency £2, and < 9 < 1. Morally speaking, 
the estimate (|120[) allows us to transfei0 up to one full degree of regularity from 
the medium frequency factor to the high frequency factor. (If it were not for the 
Holder estimate (jl 20[) (which would for instance be the case if m was rough), one 
would have to require that v and w can both individually come up with this much 
surplus regularity; this is roughly equivalent to the existence of an extra smoothing 
estimate of the type mentioned in the introduction.) 

We now turn to the specific details for each commutator in turn. 



14. The non-null- form commutators [J,Ni], [I, N 2 ] 



Now we prove (|106f> for [I, Ni] and (|107|) for [I, N2]. Because of the presence of 
the relatively smooth function Aq, these commutators can be handled by relatively 
simple tools, namely Holder's inequality, the fractional Leibnitz rule, and some 
simple commutator estimates. We will be able to obtain a decay here of 7V~ 1//2+ , 
which improves over the claimed decay of ■ 



A' 



WW 



We split Do^, Ao, and $ into low, medium, and high components as in Section ITUl 
It will suffice to show 



(121) 
(122) 
(123) 



<(D *)«, J(Mo,6*c) - (^Uo, 6 )(/$ c )) dt 



((T> <S>) a ,I(A , b d t $ c ) - (IA 0>b )(Id t §j) dt 



f (V/>4 0iO> /(* 6 V a!l tt c ) - (J*6)(/Vx,ttc)) dt 

Jtn 



< AT-V2+ 



for all triples (a, b, c) in Figure [TJ 

To prove the above commutator estimates, we will use the following bounds on the 
factors D $, A , V x , t A , V x ,t$- 

Lemma 14.1 (Spacetime estimates). On the slab [to, T] x R 3 ; we can place the low, 
medium, and high components o/Do$, Aq, V X) tAo, and Vn^i in the following 
spaces: 



^It is this ability to use the commutator structure to transfer regularity from smooth factors 
to rough ones which distinguishes the methods here from the frequency truncation method used 
by Bourgain T and later authors. In that method one usually has to rely on "extra smoothing 
estimates" to control the medium-high interactions, but these estimates are usually only available 
if there are no derivatives in the nonlinearity. 
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a 


D $a e 


A , a e 






V=m1 q g 




CB(C t ...) 


CB(L 2 ...) 


CB(Lf...) 


CB(C t ° . . .) 


CB(C t ° . . .) 


low 


rOO 


rOO 




JOO 


TOO 


med 








IK 




high 


LI 


r 2 H z x 








high 


N-y*+H- 1/4 + 


N -l/4 +I -l H 7/4+ 








high 




AT- 1 /2+ jff 3/2+ 


n-v 2 +ihI /2+ 


N-V 2 +IH l J 2+ 





77ms /or instance we have d t IA , high G CN- 1 / 2 +~B{L 2 t IH 1 J 2+ ). 



We may discard the J from the above spaces if desired thanks to the trivial embed- 
dings IH a C H a C I^H" for any a e R. 

Proo/. These follow from Lemma OH dMJ), ([Ml), ©, and ([57J. For the low 
frequency terms we use (fT()]) and ([57)) . For the medium and high frequency terms 
we use ([57]) and (|115[) . as well as the hypothesis ^ < s < 1 (actually, 5/6 < s < 1 
suffices). Note that the operator / is the identity on medium frequencies, so its 
presence there is harmless. □ 

We can now motivate the numerology behind the decay of A r_1 / 2+ . Consider the 
commutators (|121j) , (|123[) , which are roughly of the form T j xR3 0(Vt, x A-a§L Vt, x §L) 
From the above we see that the three factors are in if*, L 2 , and if*, for medium 
frequencies at least. Lemma 113.31 then allows us to estimate the above trilinear 
expression. In fact we have about half a derivative to spare; even if we reduced the 
regularity of one of the H 1 factors to H 1 / 2+ , we could still use Lemma [13.31 The 
idea is to then use (|1 15[) to convert this half derivative of room to a iV _1 / 2+ factor 
in the estimate^. The case of (|122|) is similar; the three factors are now in L 2 ,, 
H 2 , L 2 but there is still the half of derivative of surplus regularity which one can 
hope to convert to a iV _1 / 2+ gain, by using (jll5j) (and in some cases (jl20[) ). 

Unfortunately, there are a number of minor differences between ([121| . (|122|) . and 
(|123[) which require separate treatment. To systematize the numerous cases we 
shall use a number of tables. 

To prove (I121j) (which is the easiest case) for each of the six cases in Figure[T]we use 
the norms and Lemmas indicated in Figure [2j For low frequency interactions we 
use Lemma ll3.11 while for medium and high frequency interactions we use Lemma 

Hi). 

The proof of (|122j) is a little trickier because of the low regularity of dt<t>- We tackle 
the six cases in Figure Q] using the spaces and Lemmas in Figure [3] (concatenating 
the fifth and sixth cases) . Five of the cases are straightforward applications of the 

2 "lndeed, one could perhaps improve this factor even further by exploiting the room available 
in the time index. Currently we are estimating one factor in L\ and the other two in Cf. By 
using Strichartz estimates (cf. Section 1 121 or I17| ) one might be able to sacrifice integrability in 
time for regularity in space, which might then be convertible to further gains in N. 
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a 


b 


c 


D $ a G 


d t A , b G 




Lemma 








CB(C? . . .) 


CB(L 2 ...) 


C7B(C t ° . . .) 




low 


high 


high 


T OO 


7V- 1 /2+ jH -l/2+ 


7V-i/2+^i/2+ 


113.11 


high 


low 


high 


Ll 


T OO 


N -l/2 +H l/2+ 


|13.1| 


high 


high 


low 


Ll 


AT- 1 /2+ jff V2+ 


T OO 

L 10 


113.11 


med /high 


med 


high 


Ll 


H x 


j V _l/2+ jff i/a+ 


11340 


med/high 


high 


high 


Ll 


N- l ' 4 +Hl /i+ 


iV- 1 /4+ ff 3/4+ 


|13.3(i) 


med/high 


high 


med 


Ll 


N . 1/2+H y 2+ 




113.3m 



Figure 2. List of possible cases for (|121|) . the spaces in which 
to estimate the three factors, and the Lemma used to obtain the 
estimate. In this case smoothing effect of I or the commutator 
structure does not need to be exploited. Observe that in all six 
cases the product of the three norms is 0{N~ 1 / 2+ ). 



a 


b 


c 


D $ a G 
CB(C? . . .) 


A Q . b e 
CB(L 2 ...) 


CB(C? . . .) 


Lemma 


low 


high 


high 


T OO 

^10 




7V- 1 /2+iJ-l/2+ 


|13.1| 


high 


low 


high 


Ll 


T OO 

^10 


I-'Ll 


|13.2| 


high 


high 


low 


Ll 


n- x i 2+ hI /2+ 


T OO 

^10 


113.11 


med/high 
med/high 


med 
high 


high 

med/high 


Ll 
Ll 


ll% 

n-v 2 +ihI /2+ 


I-'Ll 
I-'Ll 


11200 
113.3th) 



Figure 3. List of possible cases for (|122p . the norms in which 
to estimate the three factors, and a very brief description of the 
techniques used in the estimate. When Lemma 113.11 or Lemma 
113.31 is used, the product of the three norms is 

0(jy- 1 /2+). in the 

other two cases the decay in N comes instead from commutator 
estimates. 

Lemmas of the previous section and will not be discussed further. The one case 
which is interesting is Case 4, when a is medium or high, b is medium, and c is 
high. By a Holder in time it suffices to show the commutator estimate 

\(u,I(vw) -vlw)\ < N-y 2+ \\u\\ L 2\\v\\ H 2\\Iw\\ Ll 

where v has medium frequency and w has high frequency. (Note that Lemma ll3.2l 
is not available to us here because v is not low frequency.) Since all the norms on 
the right-hand side are Ll based we may assume that u, v, w are non-negative. By 
(fT20|) with = 1/2- we then have 

|<«,7(tHo)-t;J«>)| < |<«,(]V tB l 1 / 2 -t;)(|V a :|- 1 /2+/ tt ,)>|. 

But from (TTT51) we have || \\7 x \-^ 2 +Iw\\ L 2 < N-^ 2+ \\Iw\\ L 2, and from Sobolev 
embedding we have |||V x | 1 / 2_, i;||i,«> < |M|_ff 2 - The claim follows. 

Finally, we prove (|123[) . We now argue as before, except that we now must re-shufflc 
the six cases of Figure Q] because \7 x IA n has different behaviour at medium and 
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5 


c 


CB(L 2 . . .) 


CB(C t ° . . .) 


CB(C t ° . . .) 






low 


high 


high 


T OO 

^10 




N -l/2+ H -l/2+ 




113.11 


high 


low 


high 


N -X/2 +H l/2+ 


T OO 






|13.1| 


high 


high 


low 


n~ 1 / 2 +hI /2+ 




T OO 




113.11 


med 


med 


high 






N-V 2 +H- 1/2+ 




U34,i) 


high 


med 


high 


n~ x / 2 +hI /2+ 








|l3.3rh) 


med 


high 


med/high 


Hi 


N-V*+IH l J 2+ 






|l3.3rh) 


high 


high 


med/high 


I- l H x and 


^-1/4+^3/4+ 




and 


|l3.3fiii) 


high 


high 


med/high 


7V- 1 /4+ jff 3/4+ 




/^L 2 




Il3.3riii) 



Figure 4. List of possible cases for (|123p . the norms in which 
to estimate the three factors, and a very brief description of the 
techniques used in the estimate. In all cases the product of the 
three norms is 0(N^ 1 ^ 2+ ). 



high frequencies. Similar to the previous section, the six cases in Figure Q] can now 
be handled using the spaces and Lemmas in Figure UJ 



15. The null form commutator [I, No] 

We now prove the most difficult commutator estimate, namely the estimate (|106[) 
for the null form commutator [J, No]. Here we shall need the full strength of the H s ' b 
spaces, and in particular the fact that the "&" index is s— and not just 1/2+. Here 
is the one case where we will only be able to obtain a decay of — , 1 ; r instead of 

iV -1 / 2+ . Unsurprisingly we shall also need null form estimates for these spaces. The 
time localization to the interval [io,? 1 ] bas been ignored up until now (because we 
have always done a Holder in time anyway) but is now a major technical nuisance, 
as multiplication by sharp time cutoffs destroys the "&" index of regularity. 

The major difficulty with this estimate is with the V _1 Q($, $) component of No, 
because there is no extra regularity in the s index in any of the factors to be 
sacrificed to obtain the decay in N. However, there is some extra regularity in the 
b index which can (after much work) be exploited as a substitute. Informally, the 
strategy is as follows. If at least one of the factors is low or medium frequency then 
one can obtain the decay in N through commutator estimates. Now suppose all 
factors are high frequency. We look at the spacetime Fourier transform of all three 
factors. If at least one of them is far away (> N) from the light cone then one can 
exploit the additional room in the b index to obtain the gain. The only remaining 
possibility is when all three frequencies are close to the light cone, but this means 
that their frequencies must be close to parallel (since they must add up to zero), 
at which point one can obtain some gain from the null structure. 
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It will suffice to prove that 

l [toJ1 (i)((D $) a ,/N ($ b ,$ c ) -N (/t 6 ,Itc)) dt 
for all a, b, c as in Figure [T] 



R 



< 



AT(s-i/2)- 



We first deal with the low frequency terms when one of a, b, c is low frequency. For 
this term we shall abandon the null structure and just prove 



/R 



l [t0 , T] (t)((D $)„ ) /(t 6 V ;E $ c ) - (J! 6 )V X (I! C )) A 



< n~ 1/2+ < 



1 



We argue using the following table; the estimates on Do^ and $ come from Lemma 
111.21 and (IM|) respectively. Note that the arguments are almost identical to those 
in the previous section. 



a 


b 


c 


(D $) a e 
CB(C° . . .) 


CB(C t ° . . .) 


CB(C t ° . . .) 


Lemma 


low 


high 


high 


T OO 

^10 


N - 1/2+H i/2 + 


N -l/2+g-l/2+ 


|13.1| 


high 


low 


high 


Ll 


T OO 




|13.2| 


high 


high 


low 


N-^+H- 1/2+ 




T OO 

^10 


113.11 



Now suppose that a, 6, c are in one of the three remaining cases in Figured] Since we 
have dealt with all low frequency issues there will no longer be a need to distinguish 
between |£| and (£), or between homogeneous and inhomogeneous Sobolev norms, 
etc. 



The Sobolev estimates which sufficed for all the other commutators will not work 
here, and we must exploit the null structure. Since [iojT] is contained inside [to — 
5/2, to + <5/2], it will suffice from (J94J , Lemma Hi. 2\ to prove the global spacetime 
estimate 



(124) 



R 



Mt ,T](t)(u a , IN (v b ,w c ) - N (Iv b ,Iw c )) dt 



< 



1 



■\\Ua\\H°.s-\\Lv b \\ H l, s -\\Iw c \\ H l,. 



jV(s-V 2 )- 

where u a , v b , w c are supported in the Fourier regions corresponding to a, b, c. 

Now that we are working globally in spacetime we are able to use the spacetime 
Fourier transform. We may assume that the spacetime Fourier transforms of u a , 
v b , w c are all real and non- negative. 



From (1561) we have 
\IN^w c )(r,0\ < 



u , , w |giAj2| , gi A& 
£=ei+f 2 ;T=n+7* l?i| l?l+?2| 
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and 

\N (h^Iw c )(r,O\< f m(6)m(6)(^^ + |^|)^(6,n)^(6,r 2 ) 

where £1 A £ 2 is the anti-symmetric tensor with components ^\^ J 2 — Ci^2- Moreover, 
we have 



|(/N (u&, w c ) - N (Iv b , Iw c ))~(t, £)| 
< 



KCi +6) - m(^)m(&)|(J^^i + J|lAM)f) 6 (^ lin )^ c (C 2 ,T 2 ) 

C=ei+e2;r=r 1 +r 2 |?l| |?1 + ?2| 

Also, we have 

|i[ t0 ^](r)| < (r)" 1 . 
By Parseval, we can therefore estimate the left-hand side of (|124j) by 

A^KICoT 1 + l6r 1 )^a(Co 1 T o)^(Ci. T i)w e (^2,T 2 ) 
(r + Ti + r 2 ) 

where J denotes an integration over all (to,£q), (ti,£i), (^2,^2) € R x R 3 with 
Co + 6 + 6 = 0, and u> (£1,62) is the symbol 

w(£i,6) := tt\ jr\ ■ 

m (£l)™(42j 

We perform a Littlewood-Paley decomposition. Let Nj, j = 0,1,2 range over 
dyadic numbers, with Nj the nearest dyadic number to |£j|, and let {min, med, max} 
be a permutation of {0,1,2} such that A m j n < N me< i < N max . Observe that 
A mc d ~ iVmax since £0 + £1 + £2 = 0, and that N mlu > 1 since none of a, b, c are 
low. Also we have N max > N since at least one of b, c is high. Clearly we have 
ICoT 1 + lar 1 < N m l n . Also write X t := (|&| - \n\) for i = 0, 1,2. Therefore, from 
these notations and the definition of the H s ' b norms, it will thus suffice, by the 
Cauchy-Schwarz inequality, to prove 



*;N Ini „>l.N maw ~N mcd >N N min NiN 2 XQ~ Xl~ X2" (to + T\ + T 2 ) 

(125) 

for all non- negative functions Fj. 

Next, we recall the standard estimate for the null form symbol |£i A £ 2 |. 
Lemma 15.1 (Symbol bound). We have 

|£i A 61 < < /2 ^ 1/2 iV 2 1/2 ((ro + ri + r 2 ) + A + Ai + A 2 ) 1/2 . 

Proof. See [23l Proposition 1] or [26l Proposition 8.1]. □ 
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From this lemma we see that to prove (|125|) it will suffice to show that the expres- 
sions 



(126) 
and 
(127) 



*;JV mln >i,iV roo3! ~JV me(i >jv N min Nl /2 N^ /2 X s -Xl-X^-(T +n +r 2 ) 1 /2 



N mia >l,N max ~N med >N N min N 1 1 ,2 N 1 2 ,2 \ { 3 S - 1,2) -\l-\ S l -{T +T 1 +T 2 ) 

are both O ( N{a } 1 / 2 )- ) IIj=o II-^j'IIl 2 .£^. f° r au permutations {j,fc,?} of {0,1,2}. 
In fact, we shall study several cases and show that all of them (except one) are 

For any a > 0, let a Q (t) denote the Fourier transform of (t) _0! ; this function 
appears implicitly in (|126[) . (|127[) for a = 1/2, 1. We shall need the following L\ 
estimates: 

Lemma 15.2. Let < a < 1. TTien a Q S L\ for all p < 1/(1 — a). 

Proof. We dyadically decompose d a as X)fe>o % a ipk®a where each ^ is a bump 
function adapted to [— 2 fc ,2 fc ] (possibly depending on a). Observe that i>k(t) = 
0((2 k t)~ M ) for any large M > 0. If one sums this, we see that a a (t) is rapidly 
decreasing for \t\ > 1. For \t\ < 1, we have a a (t) = (9(|i| Q_1 ) if a < 1 and 
a a (i) = 0(\ log |i||) if a = 1. The claim follows. □ 

We now estimate (| 126[) . f|127[) separately in the cases min = and min ^ 0, 
giving four cases. In all cases except Case 4, we will be able to obtain a decay of 
0{N- l ' 2 +). 

Case 1: The bounding of (|126[) when min = 0. 



We use the crude bound (valid for all £o,£i,£2) 
(128) 

to bound (PS) by 



(128) < 1 — < 1 < JV-Va+jvVa- 

m{Ni)m{N 2 ) m(iV max )^ 



nL, ^i&.Ti) 



JV mln > 



i,iW~;v me(i >Jv 7V 1/2+ iV 1 1/2 iV 2 A^ArAr(T-o + n + r 2 )V2 



undoing the Fourier transform, it thus suffices to show 

< ||uo|| ff i/2+, 3 - ||ui|| H i/2 >3 - ||u 2 ||ffO,.- . 



UQU\Uia\[2 dxdt 

RxR 3 



But this follows from the Strichartz estimates _ffV 2 +> s C L^ + L^., together with 
the embedding H°' s ~ C L^°~ "L 2 ,, which follows from interpolation between H°'° = 
L\L\ and H^' 2 + C C t °i|. 
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Case 2: The bounding of (I126P when min ^ 0. 



By symmetry we may assume min = 1. In this case we can bound 



< 



< 



m(N ) + m(N 1 )m(N 2 ) 

m(Ni)m(N 2 ) 
m(N max ) + m(N min )m(N m&x ) 



m(N min )m(N ma x) 

< m(iV min )- 1 

< N~ 1/2+ N 1/2 ~ 



We can thus estimate (|126|) by 
AT-V2+ 



so it would suffice to show that 



L 



RxR 3 



a 1 /2UoUiU2<ixdt 



< 



\\u \\ H o,s- \\ui\\ H i + . s - \\u 2 \\h°.' 



But this follows from the Strichartz embeddings H°' s C LfL 2 , and H 1+ ' s C 
L 2+ L^° (from {58j) ) , and Lemma [1521 

Case 3: The bounding of (|127|) when min = 0. 

Since the j 1 = 1 and j = 2 cases are symmetric, we may assume that j = or j = 1. 
We again use (fl28l) to bound (Tl27|) by 

11" o^-V 



AT-1/2+ 



iJV„ ta >i,Ar ra „~JV m . d >iv jv 1/2 JV 2 1/2+ aS s h) X s k -X s r{r +n+T 2 ) 



Undoing the Fourier transform and considering the j ' — and j = 1 cases separately, 
it thus suffices to show 



U0U1U2CL1 dxdt 



RxR 3 



and 



U0U1U2CI1 dxdt 



RxR 3 



< 



< 



||ll0||#l/2,(3-l/2)- ||Ul||jJ0, S - ||U2||jJl/2+,. 



||uo|| H -l/2, s - ||Ul||jjO,( 3 -l/2)- ||U2||ifl/2+, 3 - . 



For the latter estimate, we place uq in LfL^, u\ in L 2 L 2 , and U2 in L t + £^, using 
(|58[) and Lemma 115.21 Now we turn to the former. To avoid excessive notation, 
we will pretend that a\ is compactly supported rather than rapidly decreasing; the 
rapidly decreasing case can then be handled by a routine dyadic decomposition. 
We may now assume that uq, ui,U2 are compactly supported in time. From (|61[) 
we have 

||ui«2||iJ-V2,0 < ||Ul||_ ff 0,a-||« 2 || i3 -l/2 (S 
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uvai dxdt 

RxR 3 



so it suffices to show that 

< 1 1 U 1 1 ^1/2,(^-1/2) - ||v||jj-1/2 ( 0. 

But this is easily established from Plancherel's theorem. 
Case 4: The bounding of (|127|) when min ^ 0. 

By symmetry we may take min = 1. We may assume that N\ -C No,N<2 since 
otherwise we could take min = and be in Case 3. We can then bound (|127j) by 



*;1<n^n ,n 2 N^xf-^-XlrX^iro + ti + r 2 ) ' 
Using Cauchy-Schwarz, we may fix Nq and N2', but we will still need to sum in N\. 

There are two subcases. 

• Case 4. a: Ni < N. 

We will take j = 1, which is the most difficult case; the other cases are 
treated similarly or are easier. In this case to(£i) = 1, and by the mean 
value theorem 

(129) H£i,6)l <f- 

Therefore we are reduced to proving that 

(130) 

r nj = ofjfe,Tj) < 1 n 2 I, 

JV 3 |fo+6l 3 A o _A i A'-(t +ti+t 2 ) 
since |£i| = |£o + I n view of the assumptions, it is enough to prove 

(131) 



/ Exl3 fflif ^ 2 \u Q u 2 )uidxdt 



< ll«o|| ff i + .-ll«i|l H o,(.-j)-ll«all ff i- 



But this inequality follows from Lemma [15.21 Holder's inequality and the 
embeddings H°'( s ~i)- C L 2 +L 2 X , D"(H (ffi s -, iji s ") C L?iJ. The 

former embedding follows from interpolation between the embeddings H°- a = 
L 2 Ll and H°' 1 /2+ c C<>£2 ; the latter is the H s ' b version of the bilinear 
Strichartz estimates (see e.g. [27] or [28l Corollary 1.4]). 
• Case 4.b: Ni >N. 

Again we will take the most difficult case j = 1; the other cases are 
treated similarly or are easier. In this case, if ft > 1 — s, then we have 

(132) <(^) 1_S 

where the exponent will dominate the various — and + exponents 

appearing elsewhere. 

We are seeking for (3 > 1 — s as large as possible such that 
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(133) 



,r»+ 



f (Ex\ p — "i n 1 =o r nt J ^; <r 1 tt 2 IIP II 

J*;l<JVi<Af ~Af 2 \ N J 3 (--A)- ~ ]V?= 1 lj=0 Irjllifij 

JV^A^A^ 2; A|-<r D +Ti+T 2 ) 

for fixed TVi; we use the iV° + factor to sum in N\. 
It is enough to find /3 as large as possible such that 

(134) 

J KxR3 aiu a P Nl uiu 2 dxdt < N° + \\u \\ H o, s - ||"i|| (3 _„)_,(,_ i)_ IMIifM-; 

the exponent in iVf can safely absorb the various N® + factors present 

with enough room to spare to still allow a summation in N\. 

But if p < s- \ then (fTM)) comes from Holder inequality, the embedding 
H o, s - c L^L 2 X and the embedding iV 1 0+ ij(2-' 9 )->( s -3)- C L 2 t + Lf. The 
former comes from the interpolation of H°' s ~ C LfL x and H°' s ~ C L^°L X . 
The latter comes from the interpolation of 

\\p NlV \y t++Lsl < N° ++ \\p Nl v\y++ L ~- z ^r + ii«n Hl -,i + and th ° 

Sobolev embedding i/5+>° c L 2 L^°. Therefore, by choosing /3 = (s — |) — 
E3, we get an increase of O ( ^TJ~ ) * n 023 ■ 



Appendix A. Lack of extra smoothing 



In the study of global well-posedness for the cubic nonlinear wave equation (NLW) 
below the energy norm, see |17j , an important role is played by the extra smoothing 
phenomenon, that the nonlinear component of the evolution is in the energy class 
even if the initial data is not. For instance, if $ £ C t °i?| n ClH^T 1 solves (NLW) 
for some 3/4 < s < 1, then we have $ - $i in € C^H\ n C^L 2 , where <J>i; n is the 
solution to the free wave equation = with initial data $[0]. See [T7] for 

details. The reliance on such a smoothing phenomenon is not unique to the work 
|17j , but is rather an important feature of all applications of the truncation method 
put forward by Bourgain (e.g. pQ). 

In this appendix we show why this extra smoothing phenomenon fails in a certain 
quantitative sense for the system (MKG-CG). Specifically, we show 

Theorem A.l (Lack of extra smoothing). Let 3/4 < s < 1, and let e > be 

sufficiently small depending on s. Then for any M > there exists a solution 
$ € C^H S X n C\H a x - x to (MKG-CG) with ||$[0]|| [H ,] < e such that 

||0[l]-0 Un [l]|| [Hl] >M, 

where 4>n n is the solution to the free wave equation n^nn = with initial data 4>[0]. 



Notice that this implies that > 1 — s, since s > ^ 
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Proof. Suppose this were not the case. Then we could find 3/4 < s < 1 and 
arbitrarily small e such that 

||0[l]-0Un[l]||[Hi] <1 

whenever $ solved (MKG-CG) with ||$[0]||rjj«i < e, where we allow the implicit 
constant to depend on s and e. By Green's theorem for the d'Lambertian we thus 
see that 

•l 



(135) 



Oip(t,x)fii n (t,x) dxdt 



JR 



< 1 



whenever /iin is a solution to the free wave equation with ||/[1] || [l 2 ] ~ II/[0]||[l 2 ] 1- 
As we shall see, it is the component — 2i(PA) ■ V x <p, with the derivative of <j> present, 
which causes (| 135[) to fail, even with the presence of null structure. 

We now work on the spacetime slab [0, 1] x R 3 . Modifying slightly the proof of 
Theorem l5.1l in Section[9l we see (if e is small enough) that $ will obey the estimates 

||$||„*,3/4+ + ||V a; , t $||„ a -l,3/4+ + ||V S;t A || r2 „i/ 2+ < e. 

[0,1] [0,1] t 1 



In fact, from (|74|) we have 

|| V x ,t^o \\ L 2 H l /2+ ~ £2 
while a small modification of the proof of (f73| yields 



|$_$ 



.linl 



[0,1] 



Now, from 



we have 

□0 = -2i(PA) ■ \7 X <P + 0(N X ) + 0(N 3 ). 
But from the above estimates and the arguments in Section [5] we have 

||0(N 1 ) + 0(N s )|| fl j.-i..-i <e 3 
and similarly from the above estimates and (|64|) we have 



|| (PA) • - (PA hn ) ■ V^Hnll^-i,.-! < £ 3 

We thus conclude that 

(136) \\a^ + 2i{PA lin )-y x cj )li n\\ H s-i,s-i <e 3 . 



[0,1] 



Now let N ^> 1 be a large parameter to be chosen later, and let fn B be a solution 
to the free wave equation with initial data f[0] and Fourier transform supported 
on the region |£| ~ 1, and normalized so that ||/[0] ||[jyi-s] < 1. Then by energy 
estimates and (|1 36|) we conclude that 



/ / p ( / ) + 2i(PA lin )-W x (f> lin }h n dxdt 

JO JR 3 

while from (|135[) we have 



<F 3 



JR 3 



pflfa dxdt 



< N s 
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If N is sufficiently large depending on e,s, we conclude that 



(137) 



/ / [(PA lin )-V x ckin\fUndxdt 

Jo Jr3 



<^ 3 



e . 



We now construct A[0], <fi[0], f[0] so that the above statement fails, yielding the 
contradiction. Note that we need to obey the compatibility conditions {8}, but 
due to Proposition 13 . 1 1 we can always do so by selecting j4q[0], so long as we have 
the divergence-free condition div/l[0] = 0, in which case we can drop the Leray 
projection P in (|137[) , 

We shall select A[0], 0[O], /[0] to have zero initial velocity: 

d t A(o) = a f 0(o) = o = d t f(o) = o. 

Then using the Fourier transform, (|137j) becomes 
(138) 

f f cos(£|77|) cos(t|C|) cos(i|C + r?|)C • 2(0, fj)^(0, 0/(0, -£ - v) d£dr) 

JR 3 JR 3 

We let ei, e2, be the standard basis for R 3 . We choose 

0(0,0 := eN—l\i_ Nei \< x 
/(0,0 := eN'-H^+ff^ 

Al{0,T)) := e 2 /10|<l/100 
Tjl 

^2(0, 77) := l|r,-e 2 /10|<l/100 

A 3 (0,r)) :=0 

then A is divergence- free, and we see for r\ in the support of 4>(0) , A(0) respectively 
that 



1 

100 



< 



cos(i|77|)cos(t|£|)cos(t|£ + 77|) dt < 1 

Jo 

and that 

i ■ 2(0, f?)0(0, = (1 + 0(^ 1 ))eA rl_s l|r ; - e2 /io|<i/iool|^ J Ve 1 |<i 
and so (| 138[) simplifies to 

/ / l|T;-e 2 /10|<l/100l|5-Arei|<ll|g+77-Arei|<l rf^rf^ e - 
iR 3 JR 3 



But this can easily be seen to be false for sufficiently small e, giving the desired 
contradiction. □ 



Remark A. 2. The above construction shows that <f>[l] — </>ii n [l] can be made ar- 
bitrarily large in the energy norm even when ||$||rjjsi is small. It is possible to 
modify the above construction (using multiple frequency scales N) to in fact make 
(f>[l] — 0ii n [l] have infinite energy; we leave the details to the reader. 
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